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A MATHEMATICAL MODEL FOR SIMPLE LEARNING 
BY ROBERT R. BUSH AND FREDERICK MOSTELLER 


Harvard University 2 


Introduction 


Mathematical models for empirical 
phenomena aid the development of a 
science when a sufficient body of quan- 
titative information has been accumu- 
lated. This accumulation can be used 
to point the direction in which models 
should be constructed and to test 
the adequacy of such models in their 
interim states. Models, in turn, fre- 
quently are useful in organizing and 
interpreting experimental data and in 
suggesting new directions for experi- 
mental research. Among the branches 
of psychology, few are as rich as learn- 
ing in quantity and variety of available 


data necessary for model building.” 


Evidence of this fact is provided by 
the numerous attempts to construct 
quantitative models for learning phe- 
nomena. The most recent contribu- 
tion is that of Estes (2). 

In this paper we shall present the 
basic structure of a new mathematical 
model designed to describe some simple 
learning situations. We shall focus 
attention on acquisition aid extinction 
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in experimental arrangements using 
straight runways and Skinner boxes, 
though we believe the model is more 
general; we plan to extend the model 
in order to describe multiple-choice 
problems and experiments in generali- 
zation and discrimination in later 
papers. Wherever possible we shall 
discuss the correspondence between 
our model and the one being developed 
by Estes (2), since striking parallels 
do exist even though many of the 
basic premises differ. Our model is 
discussed and developed primarily in 
terms of reinforcement concepts while 
Estes’ model stems from an attempt 
to formalize association theory. Both 
models, however, may be re-inter- 
preted in terms of other sets of con- 
cepts. This state of affairs is a com- 
mon feature of most mathematical 
models. An example is the particle 
and wave interpretations of modern 
atomic theory. 

We are concerned with the type of 
learning which has been called “‘instru- 
mental conditioning’ (5), “operant 
behavior” or “type R conditioning” 
(10), and not with “classical condi- 
tioning”’ (5), ‘Pavlovian conditioning” 
or “type S conditioning” (10). We 
shall follow Sears (9) in dividing up 
the chain of events as follows: (1) per- 
ception of a stimulus, (2) performance 
of a response or instrumental act, (3) 
occurrence of an environmental event, 
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and (4) execution of a goal response. 
Examples of instrumental responses 
are the traversing of a runway, press- 
ing of a lever, etc. By environmental 
events we mean the presentation of a 
“reinforcing stimulus” (10) such «> 
food or water, but we wish to include 
in this category electric shocks and 
other forms of punishment, removal 
of the animal from the apparatus, the 
sounding of a buzzer, etc. Hence any 
change in the stimulus situation which 
follows an instrumental response is 
called an environmentalevent. A goal 
response, such as eating food or drink- 
ing water, is not necessarily involved 
in the chain. It is implied, however, 
that the organism has a motivation 
or drive which corresponds to some 
goal response. Operationally speak- 
ing, we infer a state of motivation 
from observing a goal response. 


Probabilities and How They Change 


As a measure of behavior, we have 
chosen the probability, p, that the 
instrumental response will occur dur- 
ing a specified time, 4. This proba- 
bility will change during conditioning 
and extinction and will be related to 
experimental variables such as latent 
time, rate, and frequency of choices. 
The choice of the time interval, h, will 
be discussed later. We conceive that 
the probability, p, is increased or de- 
creased a small amount after each 
occurrence of the response and that 
the determinants of the amount of 
change in p are the environmental 
events and the work or effort expended 
in making the response. In addition, 
of course, the magnitude of the change 
depends upon the properties of the 
organism and upon the value of the 
probability before the response oc- 
curred. For example, if the proba- 
bility was already unity, it could not 
be increased further. 

Our task, then, is to describe the 
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change in probability which occurs 
after each performance of the response 
being studied. We wish to express 
this change in terms of the probability 
immediately prior to the occurrence 
of the response and so we explicitly 
assume that the change is independent 
of the still earlier values of the proba- 
bility. For convenience in describing 
the step-wise change in probability, 
we introduce the concept of a mathe- 
matical opera*or. The notion is ele- 
mentary and in no way mysterious: 
an operator Q when applied to an 
operand p yields a new quantity Qp 
(read Q operating on p). Ordinary 
mathematical operations of addition, 
multiplication, differentiation, etc., 
may be defined in terms of operators. 
For the present purpose, we are inter- 
ested in a class of operators Q which 
when applied to our probability p will 
give a new value of probability Qp. 
As mentioned above, we are assuming 
that this new probability, Qp, can be 
expressed in terms of the old value, p. 
Supposing Qp to be a well-behaved 
function, we can expand it as a power 
series in p: 


Op =atapt+ap?+--- (1) 


where dp, a1, @2, - + - are constants inde- 
pendent of p. In order to simplify the 
mathematical analysis which follows, 
we shall retain only the first two terms 
in this expansion. Thus, we are as- 
suming that we can employ operators 
which represent a linear transforma- 
tion on p. If the change is small, one 
would expect that this assumption 
would provide an adequate first ap- 
proximation. Our operator Q is then 
completely defined as soon as we 
specify the constants a and a; this 
is the major problem at hand. For 
reasons that will soon be apparent, we 
choose to let a = a and a4; = 1—a—6. 
This choice of parameters permits us 
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to write our operator in the form 
Qp=p+ali—p)—bp. (2) 


This is our basic operator and equation 
(2) will be used as the cornerstone for 
our theoretical development. To main- 
tain the probability between 0 and 1, 
the parameters a and 5 must also lie 
between 0 and 1. Since a is positive, 
we see that the term, a(1 — p), of 
equation (2) corresponds to an incre- 
ment in p which is proportional to the 
maximum possible increment, (1 — ). 
Moreover, since d is positive, the term, 
— bp, corresponds to a decrement in p 
which is proportional to the maximum 
possible decrement, —~. Therefore, 
we can associate with the parameter a 
those factors which always increase 
the probability and with the param- 
eter b those factors which always de- 
crease the probability. It is for these 
reasons that we rewrcte our operator 
in the form given in equation (2). 
We associate the event of presenting 
a reward or other reinforcing stimulus 


with the parameter a, and we assume 
that a = 0 when no reward is given 


as in experimental extinction. With 
the parameter }, we associate events 
such as punishment and the work 
required in making the response. (See 
the review by Solomon [11] of the 
influence of work on behavior.) In 
many respects, our term, a(1 — p), 
corresponds to an increment in ‘‘ex- 
citatory potential” in Hull’s theory 
(6) and our term, — dp, corresponds to 
an increment in Hull’s “inhibitory 
potential.” 

In this paper, we make no further 
attempt to relate our parameters, a 
and 5, to experimental variables such 
as amount of reward, amount of work, 
strength of motivation, etc. In com- 
paring our theoretical results with 
experimental data, we will choose 
values of a and b which give the best 
fit. In other words, our model at the 
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present time is concerned only with 
the form of conditioning and extinc- 
tion curves, not with the precise values 
of parameters for particular conditions 
and particular organisms. 


Continuous Reinforcement and 
Extinction 

Up to this point, we have discussed 
only the effect of the occurrence of a 
response upon the probability of that 
response. Since probability must be 
conserved, #.¢., since in a time interval 
han organism will make some response 
or no response, we must investigate 
the effect of the occurrence of one 
response upon the probability of an- 
other response. In a later paper, we 
shall discuss this problem in detail, 
but for the present purpose we must 
include the following assumption. We 
conceive that there are two general 
kinds of responses, overt and non- 
overt. The overt responses are sub- 
divided into classes A, B, C, etc. If 
an overt response A occurs and is 
neither rewarded nor punished, then 
the probability of any mutually ex- 
clusive overt response B is not changed. 
Nevertheless, the probability of that 
response A is changed after an occur- 
rence on which it is neither rewarded 
nor punished. Since the total proba- 
bility of all responses must be unity, 
it follows that the probability ga‘ned 
or lost by response A must be compen- 
sated by a corresponding loss or gain 
in probability of the non-overt re- 
sponses. This assumption is impor- 
tant in the analysis of experiments 
which use a runway or Skinner box, 
for example. In such experiments a 
single class of responses is singled out 
for study, but other overt responses 
can and do occur. We defer until a 
later paper the discussion of experi- 
ments in which two or more responses 
are reinforced differentially. 

With the aid of our mathematical 
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operator of equation (2) we may now 
describe the progressive change in the 
probability of a response in an experi- 
ment such as the Graham-Gagné run- 
way (3) or Skinner box (16) in which 
the same environmental events follow 
each occurrence of the response. We 
need only apply our operator Q re- 
peatedly to some initial value of the 
probability ». Each application of 
the operator corresponds to one occur- 
rence of the response and the sub- 
sequent environmental events. The 
algebra involved in these manipula- 
tionsis straightforward. Forexample, 
if we apply Q to p twice, we have 


Op = O(0p) =a+(1—a—b)Qp 
=a+(1—a-—)) 
X[e+(1-—a—b)p]. (3) 


Moreover, it may be readily shown 
that if we apply Q to p successively n 
times, we have 


a a 
Ovnts> (55-2) 


xX (l1-—a—b). (4) 


Provided a and 6b are not both zero or 
both unity, the quantity (1 — a — b)* 
tends to an asymptotic value of zero 
as m increases. Therefore, Q*p ap- 
proaches a limiting value of a/(a + d) 
as m becomes large. Equation (4) 
then describes a curve of acquisition. 
It should be noticed that the asymp- 
totic value of the probability is not 
necessarily either zero or unity. For 
example, if a = b (speaking roughly 
this implies that the measures of re- 
ward and work are equal), the ultimate 
probability of occurrence in time h of 
the response being studied is 0.5. 
Since we have assumed that a = 0 
when no reward is given after the 
response occurs, we may describe an 
extinction trial by a special operator 
E which is equivalent to our operator 
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Q of equation (2) with @ set equal to 
zero: 

Ep=p-—bp=(1—b)p. (5) 
It follows directly that if we apply 


this operator E to p successively for n 
times we have 


Enp =,(1 — 6)". (6) 


This equation then describes a curve 
of experimental extinction. 


Probability, Latent Time, and Rate 


Before the above results on continu- 
ous reinforcement and extinction can 
be compared with empirical results, 
we must first establish relationships 
between our probability, ~, and ex- 
perimental measures such as latent 
time and rate of responding. In order 
to do this, we must have a model. 
A simple and useful model is the one 
described by Estes (2). Let the ac- 
tivity of an organism be described by a 
sequence of responses which are inde- 
pendent of one another. (For this 
purpose, we consider doing ‘‘nothing”’ 
to be a response.) The probability 
that the response or class of responses 
being studied will occur first is p. 
Since we have already assumed that 
non-reinforced occurrences of other 
responses do not affect p, one may 
easily calculate the mean number of 
responses which will occur before the 
response being studied takes place. 
Estes (2) has presented this calcula- 
tion and shown that the mean number 
of responses which will occur, includ- 
ing the one being studied, is simply 1/p. 
In that derivation it was assumed that 
the responses were all independent of 
one another, i.e., that transition prob- 
abilities between pairs of responses are 
the same for all pairs. This assump- 
tion is a bold one indeed (it is easy to 
think of overt responses that cannot 
follow one another), but it appears to 
us that any other assumption would 
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require a detailed specification of the 
many possible responses in each ex- 
perimental arrangement being consid- 
ered. (Miller and Frick [8] have 
attempted such an analysis for a par- 
ticular experin..nt.) It is further 
assumed that ev. ¥ response requires 
the same amount of time, A, for its 
performance. The mean latent time, 
then, is simply 4 times the mean num- 
ber of responses which occur on a 
“trial’’: 

bane: (7) 
p 
The time, /, required for each response 
will depend, of course, on the organism 
involved and very likely upon its 
strength of drive or motivation. 

The mean latent time, L, is ex- 
pressed in terms of the probability, p, 
by equation (7), while this probability 
is given in terms of the number of 
trials, m, by equation (4). Hence we 
may obtain an expression for the mean 
latent time as a function of the num- 
ber of trials. It turns out that this 
expression is identical to equation (4) 
of Estes’ paper (2) except for differ- 
ences in notation. (Estes uses T in 
place of our m; our use of a difference 
equation rather than of a differential 
equation gives us the term (1 — a — 0) 
instead of Estes’ e-*.) Estes fitted 
his equation to the data of Graham 
and Gagné (3). Our results differ 
from Estes’ in one respect, however: 
the asymptotic mean latent time in 
Estes’ model is simply 4, while we 


obtain . 
Leow n(2+*) (8) 


This equation suggests that the final 
mean latent time depends on the 
amount of reward and on the amount 
of required work, since we have as- 
sumed that a and } depend on those 
two variables, respectively. This con- 
clusion seems to agree with the data 
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.of Grindley (4) on chicks and the data 


of Crespi (1) on white rats. 

Since equation (7) is an expression 
for the mean time between the end of 
one response of the type being studied 
and the end of the next response of the 
type being studied, we may now cal- 
culate the mean rate of responding in 
a Skinner-box arrangement. If é rep- 
resents the mean time required for the 
occurrence of m responses, measured 
from some arbitrary starting point, 
then each occurrence of the response 
being studied adds an increment in i 
as follows: ’ 

Ai oh 

ao’ (9) 
If the increments are sufficiently small, 
we may write them as differentials and 
obtain for the mean rate of responding 

dn 

@ ~~ 
where w = 1/h. We shall call w the 
“activity level” and by definition w 
is the maximum rate of responding 
which occurs when p = 1 obtains. 


(10) 


The Free-Responding Situation 


In free-responding situations, such 
as that in Skinner box experiments, one 
usually measures rate of responding or 
the cumulative number of responses 
versus time. To obtain theoretical 
expressions for these relations, we first 
obtain an expression for the proba- 
bility » as a function of time. From 
equation (2), we see that if the re- 
sponse being studied occurs, the change 
in probability is Ap = a(1 — p) — dp. 
We have already assumed that if other 
responses occur and are not reinforced, 
no change in the probability of occur- 
rence of the response being studied will 
ensue. Hence the expected change in 
probability during a time interval h 
is merely the change in probability 
times the probability » that the re- 
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sponse being studied occurs in that. 


time interval: 
Expected (Ap) 
= p{a(i—p) — dp}. (11) 


The expected rate of change of proba- 
bility with time is then this expression 
divided by the time 4. Writing this 
rate as a derivative we have 


<b = wpla(t — p) — bp} (12) 


where, as already defined, w = 1/h is 
the activity level. This equation is 
easily integrated to give p as an ex- 
plicit function of time ¢. Since equa- 
tion (10) states that the mean rate of 
responding, dn/dt, is w times the prob- 
ability », we obtain after the inte- 
gration 


dn _ wo aV 
dt po(i+u)+[1—po(1+u) je 
(13) 


where we have let u =5b/a. The 
initial rate of responding at ¢ = 0 is 
Vo = who, and the final rate after a 
very long time ¢ is 





dn w w 
ven| | oe (14) 
Equation (13) is quite similar to the 
expression obtained by Estes except 
for our inclusion of the ratio u = b/a. 
The final rate of responding according 
to equation (14), increases with a and 
hence with the amount of reward given 
per response, and decreases with 6 and 
hence with the amount of work per 
response. These conclusions do not 
follow from Estes’ results (2). 

An expression for the cumulative 
number of responses during continu- 
ous reinforcement is obtained by inte- 
grating equation (13) with respect to 
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time t. The result is 


1 
1+u 


xX (1 — em) + ~| - (15) 


{at + Sog Lott +1) 


As the time ¢ becomes very large,. the 
exponentials in equation (15) approach 
zero and m becomes a linear function 
of time. This agrees with equation 
(14) which says that the asymptotic 
rate is a constant. Both equations 
(13) and (15) for rate of responding 
and cumulative number of responses, 
respectively, have the same form as 
the analogous equations derived by 
Estes (2) which were fitted by him to 
data on a bar-pressing habit of rats. 
The essential difference between Estes’ 
results and ours is the dependence, 
discussed above, of the final rate upon 
amount of work and amount of reward 
per trial. f 

We may extend our analysis to give 
expressions for rates and cumulative 
responses during extinction. Since we 
have assumed that a = 0 during ex- 
tinction, we have in place of equa- 
tion (12) 


® « —udph (16) 


dt 


which when integrated for p and mul- 
tiplied by w gives 


dm _ obs 
dt 1 + whpd 


where ~, is the probability at the be- 
ginning of extinction. The rate at the 
beginning of extinction is V, = wp,. 
Hence we may write equation (17) in 
the form 


(17) 


i a, i ee 
dé 1+ Vd*° 

An integration of this equation gives 

for the cumulative number of extinc- 


(18) 








A MATHEMATICAL MopEL For SIMPLE LEARNING 


tion responses 


m = 5 log (1 + V.bt] 


“i + tog (=) . (19) 


This result is similar to the empirical 
equation m = K log t, used by Skinner 
in fitting experimental response curves 
(10). Our equation has the additional 
advantage of passing through the ori- 
gin as it must. 

It may be noted that the logarithmic 
character of equation (19) implies that 
the total number of extinction re- 
sponses, m, has no upper limit. Thus, 
if our result is correct, and indeed if 
Skinner’s empirical equation is correct, 
then there is no upper limit to the 
size of the “reserve”’ of extinction re- 
sponses. For all practical purposes, 
however, the logarithmic variation is 
so slow for large values of the time é, 
it is justified to use some arbitrary 
criterion for the “completion” of ex- 
tinction. We shall consider extinction 
to-be “‘complete”’ when the mean rate 
of responding V has fallen to some 
specified value, Vy. ‘Thus, the “total” 
number of extinction responses from 
this criterion is 


1 Ve 
mr = = log v," (20) 
We now wish to express this ‘‘total” 
number of extinction responses, mr, 
as an explicit function of the number 
of preceding reinforcements, m. The 
only quantity in equation (20) which 
depends upon 2 is the rate, V,, at the 
beginning of extinction. If we assume 
that this rate is equal to the rate at 
the end of acquisition, we have from 
equations (4) and (10) 


Ve = = be = Vaan 


acd ( Vensx ” Vo)(1 ge a b)* (21) 


where we have let 


a 
aes (22) 


and where Vo = wp is the rate at the 
beginning of acquisition. If we now 


Vex @ @ 


substitute equation (21) into equation 
(20), we obtain 


1 Vines Vinax Vo 
mr = — log ee Coneens | S.C 


xX(i-a- ays} . (23) 


b Vy; V; 


This result may be compared with the 
data of Williams (12) obtained by 
measuring the “total’’ number of ex- 
tinction responses after 5, 10, 30 and 
90 reinforcements. From the data, 
the ratio Vinax/V; was estimated to 
be about 5, and the ratio Vo/V; was 
assumed to be about unity. Values 
of a = 0.014and b = 0.026 were chosen 
in fitting equation (23) to the data. 
The result is shown in the figure. 


Fixed Ratio and Random Ratio 
Reinforcement 


In present day psychological lan- 
guage, the term “‘fixed ratio’’ (7) refers 


to the procedure of rewarding every 


kth response in a free-responding situ- 
ation (k = 2,3, ---). Ina “random 
ratio” schedule, an animal is rewarded 
on the average after k responses but the 
actual number of responses per reward 
varies over some specified range. We 
shall now derive expressions for mean 
rates of responding and cumulative 
numbers of responses for these two 
types of reinforcement schedules. If 
we apply our operator Q, of equation 
(2), to a probability p, and then apply 
our operator E, of equation (5), to Op 
repeatedly for (k — 1) times, we obtain 


(E*0)p = (1-0)*"[p+a(1—p) —bp] 
=pta'(i—p)—b’p (24) 
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“Total”’ number of extinction responses as a function of the number of reinforcements. 
Curve plotted from equation (23) with 5 = 0.026, a = 0.014, Vmax = 5Vo, Vy = Vo. 


Data from Williams (12). 


where 
a’=a(1—b)* 


=a{i—(k—1)b4----}Sa (25) 
and 


b’=1—(1—b)* 


=kb 1— stot. kb. (26) 


The symbol = means “approximately 
equal to.”” In the present case the 
exact approach would be to retain the 
primes on a and 6 throughout; how- 
ever the approximations provide a link 
with the previous discussion. The 
approximations on the right of these 
two equations are justified if kbd is 
small compared to unity. Now the 
mean change in p per response will be 
the second and third terms of equation 
(24) divided by k: 


; b’ 
Ap=F(1-~)-> 


(27) 


pad 


(1 — p) — dp. 





This equation is identical to our result 
for continuous reinforcement, except 
that a’/k replaces a and b’/k replaces 5. 

We may obtain a similar result for 
the “random ratio” schedule as fol- 
lows: After any response, the proba- 
bility that Q operates on p is 1/k and 
the probability that E operates on 
p is (1 — 1/k). Hence the expected 
change in ~ per response is 


1 
Expected (Ap) = 7 Qp 


+ (1 — 1/k)Ep — . 


After equations (2) and (5) are inserted 
and the result simplified, we obtain 
from equation (28) 


Expected (Ap) 
=F (1 — p) — bp. (29) 


(28) 


This result is identical to the approxi- 
mate result shown in equation (27) for 
the fixed ratio case. Since both equa- 
tions (27) and (29) have the same 
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form as our result for the continuous 
reinforcement case, we may at once 
write for the mean rate of responding 
an equation identical to equation (13), 
except that a is replaced by a’/k. 
Similarly, we obtain an expression for 
the final rate of responding identical 
to equation (14) except that a is re- 
placed by a’/k. This result is meant 
to apply to both fixed ratio and ran- 
dom ratio schedules of reinforcement. 

In comparing the above result for 
the asymptotic rates with equation 
(14) for continuous reinforcement, we 
must be careful about equating the 
activity level, w, for the three cases 
(continuous, fixed ratio and random 
ratio reinforcements). Since 1/w rep- 
resents the minimum mean time be- 
tween successive responses, it includes 
both the eating time and a “recovery 
time.”” By the latter we mean the 
time necessary for the animal to re- 
organize itself after eating and get in 
a position to make another bar press 
or key peck. In the fixed ratio case, 
presumably the animal learns to look 
for food not after each press or peck, 
as in the continuous case, but ideally 
only after every k response. There- 
fore both the mean eating time and 
the mean recovery time per response 
are less for the fixed ratio case than 
for the continuous case. In the ran- 
dom ratio case, one would expect a 
similar but smaller difference to occur. 
Hence, it seems reasonable to conclude 
that the activity level, w, would be 
smaller for continuous reinforcement 
than for either fixed ratio or random 
ratio, and that w would be lower for 
random ratio than for fixed ratio when 
the mean number of responses per 
reward was the same. Moreover, we 
should expect that w would increase 
with the number of responses per re- 
ward, k. Even if eating time were 
subtracted out in all cases we should 
expect these arguments to apply. 
Without a quantitative estimate of 


321 


the mean recovery time, we see no 
meaningful way of comparing rates of 
responding under continuous reinforce- 
ment with those under fixed ratio and 
random ratio, nor of comparing rates 
under different ratios (unless both 
ratios are large). The difficulty of 
comparing rates under various rein- 
forcement schedules does not seem to 
be a weakness of our model, but rather 
a natural consequence of the experi- 
mental procedure. However, the im- 
portance of these considerations hinges 
upon the orders of magnitude involved, 
and such questions are empirical ones. 


Aperiodic and Periodic Reinforcement 


Many experiments of recent years 
were designed so that an animal was 
reinforced at a rate aperiodic or peri- 
odic in time (7). The usual procedure 
is to choose a set of time intervals, 
Ti, ---, In, Which have a mean value 
T. Some arrangement of this set is 
used as the actual sequence of time 
intervals between rewards. The first 
response which occurs after one of 
these time intervals has elapsed is 
rewarded. 

To analyze this situation we may 
consider k, the mean number of re- 
sponses per reward, to be equal to the 
mean time interval T multiplied by 
the mean rate of responding: 


dn 
k=T == 
Equation (29) for the expected change 
in probability per response is still valid 
if we now consider k to be: a variable 
as expressed by equation (30). Thus, 
the time rate of change of is 


Twp. | (30) 


dp a 
u =7(l — p) — wbp*®. (31) 


With a little effort, this differential 
equation may be integrated from 0 to 
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t to give 
dn 


— = wp 


dt 
w@ (s— 1) + (s+1)Ke-™/? 
“s 1—Ke-elt 





(32) 


where 
2 = 2wTb/a, (33) 
s= Vi+2z, (34) 
K = (1+2p0—s)/(1+2p0+5). (35) 


For arbitrarily large times #, the final 
rate is 


(36) 


dn w 
(F cs - = (s — 1). 
For sufficiently large values of T, 


z becomes large compared to unity 
and we may write approximately 


S = wV2/z = wVa/buT. (37) 
(2). 


Thus, for large values of 7, the final 
rate varies inversely as the square root 
of T. 

Periodic reinforcement is a spe- 
cial case of aperiodic reinforcement 
in which the set of time intervals, 
Ti, «++, Ta, discussed above, éonsists 
of a single time interval, T. Thus, 
all the above equations apply to both 
periodic and aperiodic schedules. One 
essential difference is known, however. 
In the periodic case the animal can 
learn a time discrimination, or as is 
sometimes said, eating becomes a cue 
for not responding for a while. This 
seems to be an example of stimulus 
discrimination which we will discuss 
in a later paper. 


Extinction After Partial Reinforcement 
Schedules 


In the discussion of extinction in 
earlier sections, it may be noted that 
the equations for mean rates and 
cumulative responses depended on the 
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previous reward training only through 
V., the mean rate at the beginning of 
extinction. Hence, we conclude that 
equations (18) and (19) apply to 
extinction after any type of reinforce- 
ment schedule. However, the quan- 
tities V, and 5 in our equations may 
depend very much on the previous 
training. Indeed, if our model makes 
any sense at all, this must be the case, 
for “‘resistance’’ to extinction is known 
to be much greater after partial rein- 
forcement training than after a con- 
tinuous reinforcement schedule (7). 
Since the rate at the start of extinc- 
tion, V., is nearly equal to the rate at 
the end of acquisition, it will certainly 
depend on the type and amount of 
previous training. Howev +, the log- 
arithmic variation in equacions (19) 
and (20) is so slow, it seems clear that 
empirical results demand a dependence 
of b on the type of reinforcement 
schedule which preceded extinction. 
We have argued that d increases with 
the amount of work required per re- 
sponse. We will now try to indicate 
how the required work might depend 
upon the type of reinforcement sched- 
ule, even though the lever pressure or 
key tension is the same. For con- 
tinuous reinforcement, the response 
pattern which is learned by a pigeon, 
for example, involves pecking the key 
once, lowering its head to the food 
magazine, eating, raising its head, and 
readjusting its body in preparation for 
the next peck. This response pattern 
demands a certain amount of effort. 
On the other hand, the response pat- 
tern which is learned for other types 
of reinforcement schedules is quite 
different; the bird makes several key 
pecks before executing the rest of 
the pattern just described. Thus we 
would expect that the average work 
required per key peck is considerably 
less than for continuous reinforcement. 
This would imply that d is larger and 
thus “‘resistance’’ to extinction is less 
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for continuous reinforcement than for 
all other schedules. This deduction 
is consistent with experimental results 
(7). However, this is just part of the 
story. For one thing, it seems clear 
that it is easier for the organism to 
discriminate between continuous rein- 
forcement and extinction; we have not 
handled this effect here. 


Summary 


A mathematical model for simple 
learning is presented. Changes in the 
probability of occurrence of a response 
in a small time h# are described with 
the aid of mathematical operators. 
The parameters which appear in the 
operator equations are related to exper- 
imental variables such as the amount 
of reward and work. Relations be- 
tween the probability and empirical 
measures of rate of responding and 
latent time are defined. Acquisition 
and extinction of behavior habits are 
discussed for the simple runway and 
for the Skinner box. Equations of 
mean latent time as a function of trial 
number are derived for the runway 
problem; equations for the mean rate 
of responding and cumulative numbers 
of responses versus tirhe are derived 
for the Skinner box experiments. An 
attempt is made to analyze the learn- 
ing process with various schedules of 
partial reinforcement in the Skinner 
type experiment. Wherever possible, 
the correspondence between the pres- 
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ent model and the work of Estes (2) 
is pointed out. 
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SECONDARY REINFORCING EFFECTS WITH 
TWO DIFFERENT DRIVES 
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This study was designed to answer 
the following question: Does secondary 
reinforcement, if and when it occurs, 
operate to produce quantitatively dif- 
ferent effects with different drives? The 
two drives selected for investigation 
were hung -r and thirst, and the process 
selected was experimental extinction. 

The concept of secondary reinforce- 
ment occupies an important position in 
Hull’s (11) theoretical system. Stimuli 
which are consistently present simul- 
taneously with the occurrence of condi- 
tions of primary reinforcement are said 
to acquire properties of reinforcement 
themselves. Since the primary gradient 
of reinforcement is considered by Hull 
to be only about 30 seconds, it can be 
seen how pivotal is secondary reinforce- 
ment in maintaining a somewhat ex- 
tended behavior chain. Although the 
concept is perhaps not immediately rec- 
ognizable in the characteristic sign- 
gestalt terminology of Tolman (22), it 
nevertheless does appear in a somewhat 
modified form when Tolman writes that 
“the ‘sign object’ becomes the ‘goal ob- 
ject’” (22, p. 148). That this is in- 
deed secondary reinforcement in sign- 


1 Hull’s view is that “the first secondary re- 
inforcing stimulus acquires its power of rein- 
forcement by virtue of having conditioned to 
it some fractional component of the need re- 
duction of the goal situation, whose occur- 
rence, wherever it takes place, has a specific 
power of reinforcement in a degree propor- 
tionate to the intensity of that occurrence” 
(11, p. 100). 


gestalt terminology is evident from the 
experiment which Tolman cites; this is 
an investigation by Williams (25), who 
found that animals would learn a 14- 
unit multiple T-maze with no incentive 
other than an empty food box from 
which they had previously been fed. In 
Hull’s terminology this box -had ob- 
viously acquired secondary reinforcing 
properties. 

It cannot be said, however, that even 
within the ranks of those who might be 
designated as “Hullians” there is com- 
plete agreement as to the nature of 
secondary reinforcement. Perhaps the 
chief modification is that made by 
Spence, who suggests that possibly no 
primary gradient of reinforcement ex- 
ists; and that, consequently, learning 
involving periods of delay proceeds un- 
der secondary reinforcement exclusively. 
To quote Spence: 


“The hypothesis suggested . . . is that all 
such learning occurs as a result of second- 
ary reinforcement, the action of which is 
conceived to take place immediately upon 
the occurrence of the response” (21, p. 8). 


In connection with the gradient of re- 
inforcement it is perhaps worth men- 
tioning in passing that in his original 
formulation Hull (9) regarded his goal 
gradient hypothesis as a primary prin- 
ciple; subsequently, however, by invok- 
ing the concept of secondary reinforce- 
ment, he transformed this, making it a 
secondary, derivable phenomenon (10). 
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A number of studies have been un- 
dertaken to demonstrate the effects of 
secondary reinforcement; in other stud- 
ies the concept has been invoked to 
make theoretical sense out of osten- 
sibly contradictory phenomena. Con- 
sider, first, some of the latter studies: 
The failure of Warden and Haas (23) 
and of Watson (24) to find differences 
in the rate of learning in delayed and 
non-delayed reward is attributed by 
Hull to their failure to control second- 
ary reinforcement, that is, the animals 
were admitted immediately into the 
food box. Differences in results be- 
tween Hamilton (8) and Wolfe (26) 
(both of whom found learning with de- 
lays up to 20 minutes) and Perin (15), 
whose work constitutes the primary 
basis of Hull’s postulation of a 30 sec- 
ond primary gradient, are also attrib- 
uted by Hull to control of secondary re- 
inforcement L; Perin. Similarly, Denny 
(3) attempts to explain Humphreys’ 
(12) demonstration that experimental 
extinction requires a greater number of 
trials when animals have been rein- 
forced less than 100 per cent of the 
time on the grounds that secondary re- 
inforcement factors were uncontrolled 
on the so-called non-reinforcement runs; 
and a somewhat similar argument is 
adduced by Lawrence and Miller (14) 
with respect to an analogous study by 
Finger (5). 

Among studies usually cited as evi- 
dence for the existence of the phe- 
nomenon of secondary reinforcement 
are those of Bugelski (1) and Skinner 
(20), who showed that response strength 
was affected by the click of the Skinner 
box mechanism. More recently Denny 
(2) presented evidence in support of 
his hypothesis that the use of differ- 
ential end boxes ought to increase 
learning efficiency. Since this improve- 
ment in learning is a theoretical require- 
ment. demanded by (i.e., deducible 
from) the secondary reinforcement for- 
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mulation, Denny interpreted the 20 per 
cent improvement in efficiency actually 
found as evidence for the effects of sec- 
ondary reinforcement. Perkins (16), in 
an ingenious experimental design which 
aimed at eliminating the differential ef- 
fects of secondary reinforcement by sys- 
tematically shifting the arms of the T- 
maze, eliminating external cues, and 
rotating the maze, found significant dif- 
ferences in learning. He interpreted his 
results as evidence that the “extent of 
delay of reinforcement gradient is a 
function of the degree to which secund- 
ary reinforcement is operative” (16, p. 
379). Following Spence, he expressed 
doubt about the existence of a primary 
gradient of reinforcement. Grice (6) 
presented evidence which he interpreted 
as “consistent with a theory which as- 
sumes no primary delay of reinforce- 
ment gradient but accounts for learn- 
ing under delayed reward conditions in 
terms of some type of immediate sec- 
ondary reinforcement” (6, p. 16). The 
study is of additional interest because 
Grice interpreted the data as evidence 
for the acquisition of secondary rein- 
forcement properties by proprioceptive 
stimuli. Such a possibility was previ- 
ously discussed by Spence in the analy- 
sis already cited (20), on the basis of 
experimental findings by Gulde (7) and 
by Riesen (17). 

It must be admitted that the selec- 
tion of experimental extinction as the 
process to be studied is not without 
problems; and this is perhaps particu- 
larly true in the maze situation, where 
there is no clear operant behavior, a 
circumstance which clouds the behavior 
with some ambiguity. Moreover, there 
appears to be no generally accepted cri- 
terion of extinction, and it is at least 
possible that the use of different criteria 
by different investigators may lead to 
apparently conflicting results. Among 
various criteria of extinction reported 
by investigators are two successive re- 
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sponses to the non-preferred side of the 
T-maze (4), a ten-second latency of re- 
sponse (18), and a five-minute latency 
on two successive trials (14). The cri- 
terion selected for the present study 
was based on ihe probability integrals 
of the ¢ distribution; it was considered 
that the criterion of extinction was 
reached when five successive runs satis- 
fied the equation 


k—np 
\npg 


which means, of course, that any prob- 
ability of ¢ greater than .25 is also con- 
sidered to be satisfactory for the ex- 
tinction criterion. 

The subjects used in this investiga- 
tion were 32 albino rats about 90 days 
old which had not previously been used 
for experimental purposes. They were 
assigned at random to groups as will be 
indicated subsequently. The apparatus 
consisted of a single unit T-maze with 
white and black compartments as shown 
in the figure, 16 metal food cups painted 
white inside and outside, 16 metal food 
cups painted black, 16 white-lined food 
boxes, and 16 black-lined food boxes. 
tn the preliminary training period all 


Wrti1aM SEEMAN AND Nancy K. KJenaas 


animals were given the opportunity to 
adapt to a 24-hour maintenance sched- 
ule for a week before training. Sixteen 
of the animals were put on a schedule 
of water deprivation and 16 on a sched- 
ule of food deprivation. Following this, 
they were run one trial a day for eight 
days in a straightaway, with an appro- 
priate incentive. In the T-maze learn- 
ing situation eight of the hungry ani- 
mals found food in the white compart- 
ment at the right and eight found food 
in the black compartment at the left; 
eight of the thirsty animals found water - 
in the white compartment and eight in 
the black compartment. Assignment to 
these groups was random. The metal 
cups containing the food and the water 
were appropriate to the compartment, 
that is, the white cups were placed in 
the white compartment and the black 
cups in the black compartment. When 
the animals had tasted the food or the 
water they were lifted from the maze 
and placed in the appropriately lined 
individual food boxes to complete feed- 
ing or drinking. The metal cups here 
were again appropriate. 

For the extinction trials each group 
was divided in half. There were there- 
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B—black compartment. BC—black cup. 


WC—white cup. 
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TABLE I 


NuMBER or Runs REeguirep By Eacu S To REACH THE CRITERION OF EXTINCTION 
Method of Extinction 


S-WR D-WL 
trials trials 

D4 14 24 

E-1 14 18 

E3 3 Cc 14 

C3 14 33 


! 
i 
Hunger 


T 76 89 


Mean 19.00 22.25 


I- 20 31 
I- 15 16 
H- 14 25 
J- 
T 


~+=+DRIVE-<-~< 


! 
Thirst 


14 23 


63 "95 
23.75 


184 
23.00 


Mean 15.75 


GT 139 
Mean 17.375 


fore eight groups, which were labelled 
as follows: 


S-WR: i.e., same-white, right; these ani- 
mals, which had been fed in the white 
compartment, now found empty food 
cups in that same compartment. 

D-WL: i.e., different-white, left; these 
animals, which had been fed in the 
black compartment at the left, now 
found empty white cups in a white 
compartment. 

S-BL: i.e., same-black, left; these ani- 
mals, which had been fed in the black 
compartment at the left, now found 
empty black fvod cups in that same 
compartment. 

D-BR: i.e., different-black, right; these 
animals, which had been fed in the 
white compartment at the left, now 
found a black cup, empty, in a black 
compartment, 


The experimental procedure with the 
thirsty animals was similar, thus pro- 
viding the eight experimental groups 
indicated. The animals were placed in 
appropriate boxes containing empty food 


S-BL D-BR 

Ss trials Ss trials 

A-3 14 D-1 ax" 

B-3 19 E-4 14 
A-2 14 D-3 14 
B-2 ii C4 15 
58 65 

14.50 16.25 


288 
18.00 


23 14 
14 13 
14 14 
14 38 


65 79 
16.25 19.75 


123 144 
15.375 18.00 


302 
18.875 
590 


cups or empty water cups depending on 
the extinction group to which they were 
assigned. They were permitted to re- 
main there for an hour and were then 
given food or water in the home cage. 
Failure to make a choice at the choice- 
point after a ten-minute period was 
taken as an indication that the animal 
had attained the extinction criterion. 
The data for all eight groups (num- 
ber of runs required to reach the extinc- 
tion criterion) are presented in Table I. 
The statistical treatment applied to 
these data was an analysis of variance. 
Since this method assumes homogeneity 
of variance, it was first necessary to per- 
form the Welch-Nayer L-1 test to de- 
termine the justification for such an as- 
sumption. The resulting L-1 value was 
.640; since this is greater than the ta- 
bled five per cent value of .520 there is 
no reason to reject the assumption of 
homogeneity, and the method of analy- 
sis of variance is permissible. The es- 
serials of this analysis are presented in 
Tabie **. It will be immediately ob- 
vious that the variation associated with 
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Taste II 


ANALYSIS OF VARIANCE OF EXTINCTION 
TRIALS 





Sum of 


Source of variation squares 





6.125 
252.00 
1227.50 
50.25 


Between drives 
Between methods 
Within groups 
Residual 





Total 1535.875 














drive differences is inconsequential. No 
F-test is needed in this case to show that 
there were no differential secondary re- 
inforcing effects associated with differ- 
ence in drive, and the answer to the 
question posed at the beginning of this 
paper may be said to be negative. 

The test involving method variance is 
as follows: F = 84.00 ~ 51.146 = 1.642. 
This F, for the indicated degrees of 
freedom, is not significant at *he five per 
cent level. This was verified by per- 
forming an analysis of variance sepa- 
rately for the hunger and the thirst 
groups. The relevant data are pre- 
sented in Table III. Neither of the 
two F’s turns out to be significant. 

The results of this investigation must 
be correlated with the recent paper by 
Schoenfeld, Antonitis, and Bersh (19); 
they have made it quite clear that the 
cencept of secondary reinforcement and 
the conditions required for imparting 


TaBLeE III 


ANALYSIS OF VARIANCE OF EXTINCTION 
TRIALS FOR HUNGER AND THIRST 
Groups SEPARATELY 





Sum of df 


Source of variation Squares 





137.50 3 
582.50 | 12 
720.00 | 15 


Within groups 
Total 


Lee groups 


164.75 3 
645.00 | 12 
809.75 | 15 


T Between groups 
Within groups 
Total 

















WiitraM SEEMAN AND Nancy K. KJENaAas 


reinforcing properties to an initially 
neutral stimulus require a great deal of 
clarification. Their study deals with 
operant behavior, but there seems to 
be no reason why their failure to se- 
cure secondary reinforcement and their 
analysis of the complexity of the prob- 
lem should not bear some instructive 
lessons for maze behavior. The present 
investigation reveals no differential ei- 
fects associated with different drives; 
but one is not yet permitted to conclude 
that the nature of the drive is irrelevant. 
Indeed, in the words of Schoenfeld, 
Antonitis, and Bersh, “it raises the gen- 
eral problem of identifying and specify- 
ing the conditions necessary and suffi- 
cient for establishing a neutra! stimulus 
as a secondary reinforcer” (19, p. 40). 
We, however, would go further. We 
would raise a question about the condi- 
tions, that is, we would raise a question 
as to whether there might not be a re- 
lation between the set of conditions re- 
quired for imparting the reinforcement 
and the method selected for demonstrat- 
ing the reinforcement. It seems to us 
a gratuitous assumption to say that the 
necessary and sufficient conditions woulda 
be identical where the demonstration oc- 
curs in the acquisition of a response and 
in the extinction of one. 
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PSYCHOLOGY AND THE CONCEPT OF “LIFE” 


BY EZENDON SMITH 
The Pennsylvania State College 


Contemporary psychology is a natu- 
ralistic discipline. Its introductory texts 
are discreetly mechanistic; its advanced 
texts are bolder. Its journals look down 
their noses at animistic terminology, 
and its heavier thinkers hasten to as- 
sure us that there must be “neural cor- 
relates” for even their more elaborate 
constructs. 

Naturalism is a point of view which 
must be defended, and the orthodox bi- 
ologist has, of course, been defending it 
for decades. Nevertheless, there re- 
main distinguished heretics (3, 6,7). It 
is an interesting fact that the success or 
failure of the defense is not often a mat- 
ter of great importance to the average 
biologist. He works with more or less 
isolated tissues: a muscle, a kidney, a 
lung. Almost everyone will grant that 
each of these isolates is a mechanistic 
system, and no controversy arises. 
What everyone will mot admit is that 
there is no central vitalistic principle. 
The “mind” is the stronghold of vital- 
ism, and it is for the proprietors of the 
mind that the question of vitalism versus 
mechanism assumes the greatest signifi- 
cance. 

Presumably the ultimate defense of 
the mechanistic view lies in an explana- 
tion of life-processes in physicochemical 
terms. When the biochemist completes 
his job, we feel, we shall have an argu- 
ment difficult to answer. But this is 
strictly a long-term prospect, and the 
mechanist must now look to less impos- 
ing bulwarks. 

Among these attainable defenses we 
should expect to find an adequate defi- 
nition of the concept of “life.” The 
concept pretty clearly exists on a non- 
verbal level; there is little variability in 


the subject-matter of biology texts. Yet 
the verbalization of the concept has long 
eluded us. And so long as we cannot 
say what life is, even descriptively, so 
long will there be suspicion that we can 
never say, and that life can be defined 
only in terms of itself. The purpose of 
this brief note is to call attention to re- 
cent advances toward an adequate defi- 
nition of life, and to the possibility that 
the problem of definition has actually 
been solved. 

The new progress hinges largely on 
the biochemical concept of autocatalysis. 
The concept has become especially im- 
portant during the last decade or two, 
particularly in connection with the in- 
vestigation of the virus and the enzyme. 
As the word implies, autocatalysis is the 
process in which a substance acts to 
catalyse, from its environment, addi- 
tional quantities of the substance itself. 
Autocatalytic action is an obvious prop- 
erty of the virus, and on close examina- 
tion it appears indeed to be the sole 
claim to animation which that organism 
is able to advance. 

Here then is a criterion which seems 
to differentiate the borderline cases, and 
there are indications that it will serve 
more broadly as a basic indicium of life. 
One authority has recently said so ex- 
plicitly (5); another, Jaques Loeb, was 
essentially saying so many years ago 
(4); and the notion of autocatalysis 
is embodied in other sophisticated at- 
tempts to delineate the concept of life 
(1, 2). 

This general course of thought may 
perhaps be integrated in formal defini- 
tions. It seems reasonable now to say 
that “life” is a collective term for “liv- 
ing objects”; and a “living object” is 
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an object which will absorb substances 
from some environment and synthesize 
from them a replication of each of the 
substances which comprise that object. 
It will be noted that this formulation 
implies that the particular kinds of 
growth and reproduction often consid- 
ered as criteria of life are valid ones. 
At the same time, by specifying synthe- 
sis, it eliminates on one hand the bother- 
some instances of crystalline growth and 
osmotic swelling and on the other such 
cases of pseudo-reproduction as frag- 
mentation. By specifying absorption, it 
disallows the oft-cited claims of the 
burning flame, without, perhaps, in- 
fringing upon those of the virus. And 
lastly, by not insisting that living eb- 
jects do perform as indicated, but rataer 
only that they wil/, this formulation 
manages. to embrace the spores, seeds, 
and other forms of latexi life which 
have long been a source of difficulty. 
The ordinary definition is an arbi- 
trary thing, and it cannot be challenged. 
When we attempt to verbalize an exist- 
ing concept, however, there arise legiti- 
mate empirical questions of validity. 


The present formulation must be pre- 
pared to withstand such probing. If it 
does so successfully, it promises to pro- 
vide support for a naturalistic philoso- 
phy of psychology. The implications 
for pedagogy, thought, and research 
may then be considerable. 


REFERENCES 


1, Atexanver, J. Life; its nature and origin. 
New York: Reinhold Publishing Cor- 
poration, 1948. 

2. Gerarp, R. W. Unresting cells. New York 
and London: Harper and Brothers, 
1940... 

3. Liu, R. S. General biology and philoso- 
phy of organism. Chicago: University 
of Chicago Press, 1945. 

4. Lozs, J. The organism as a whole. New 
York and London: G. P. Putnam’s 
Sons, 1916. 

5. Prunxett, C. R. The primary physico- 
chemical basis of organic evolution. In 
J. Alexander (Ed.), Colloid chemistry. 
New York: Reinhold Publishing Cor- 
poration, 1944, Vol. 5. 

6. Scuroepincer, E. What is life? New 
York: The Macmillan Company, 1945. 

7. SHerrincton, C. Man on his nature. New 
York: The Macmillan Company, 1941. 


[MS. received October 3, 1950] 


} 











PERCEPTUAL DEVELOPMENT: SOME TENTATIVE 
HYPOTHESES * 


BY GARDNER MURPHY 
City College, New York 


AND 


JULIAN HOCHBERG 
Cornell University 


Such an abundance of new material is 
coming to hand from experimental psy- 
chology and clinical psychology regard- 
ing the dynamics of perception that we 
kave—as have others (6, 61)—been in- 
terested in attempting an integration 
utilizing both sources. The primary 
conception which we have exploited (at 
its root not so very different from John 
Dewey’s 1896 formulation) is that per- 
ception is a form of continuous adjust- 
ment to environmental requirements, in- 
volving not simply a “sensorium” or seat 
of cognitive functions, but the whole 
organism. But from this basic idea, em- 
pirical facts and personal idiosyncrasies 
may lead speculation in any one of 
many directions; and the one which has 
especially interested us is the concep- 
tion of perceptual learning, the concep- 
tion that the whole organism, in coping 
with its environment, progressively al- 
ters its modes of perception and de- 
velops more and more complex ones 
that serve it better. What we are try- 
ing to say will be presented under a se- 
ries of eighteen hypotheses. It would 
be good if these hypotheses could be 
prepared in the form of postulates, but 
they are neither as clearly thought 
through nor as coherent as are the usual 
postulates of science, and on the other 


1 The main outlines of this paper were pre- 
sented by the senior author on June 7, 1949, 
to the British Psychological Society in Lon- 
don. The present form of the paper repre- 
sents both authors’ views equally, not by com- 
promise but by coalescence. 


hand some of them take the form of 
propositions regarding which a good 
deal of empirical material has been as- 
sembled. They are presented here, how- 
ever, neither as substantiated principles 
nor as original and unfounded theses, 
but rather as a very tentative system of 
ideas to be worn “like a loose garment,” 
a makeshift which may be replaced by 
more appropriate garments when the 
facts are clearer and their interrelations 
more evident. 


Tue HyportrHeEseEs 


(1) Perception is typically, if not al- 
ways, both unitary and structured—a 
unitas multiplex. The tree-frog’s piping 
or the roll of drums gives tonal ele- 
ments, but elements in temporal order 
and with a quality which reflects the 
whole context of the hearing. This first 
hypothesis involves the recognition of 
the difference between an assemblage of 
parts and an organized structure, and 
also the conception that, in this whole, 
differentiable aspects can still be ob- 
served. There are phenomenal aspects 
which can be noted separately although 
they are held in a sort of central coher- 
ence: such “parts” (objects, extended 
forms, etc.) display considerable sta- 
bility and separable identity insofar as 
they are of defined extent, and can with 
relative independence be shuffled and 
manipulated; thus, a leaf or a robin in 
the visual field may change its spatial 
location, size, orientation, and may even 


332 





PERCEPTUAL DEVELOPMENT 


be partially occluded by other objects 
and still maintain its individual identity 
and form (23). However (as is at- 
tested by their constancy in the face of 
changing stimulation), the existence and 
the characteristics of such spatially de- 
limited “parts” cannot be ascribed to 
the action of any comparably separable 
or spatially delimited aspects of the pat- 
tern of stimulus excitation (i.e., per- 
ceived size is not a univocal function of 
visual angle, or retinal shape, or retinal 
stimulus intensity, etc.). Instead, the 
characteristics of any phenomenal “part” 
depend upon relationships within the 
total extended field of impinging stimu- 
lation, and upon many diverse and, fre- 
quently, mutually reinforcing (8) as- 
pects of the stimulus distribution.” 

(2) A primordial aspect of the struc- 
ture of the percept is differentiation be- 
tween figure and ground. Here are em- 
phasized facts such as the following: 
that the phenomenal content of visual 
perception consists of shaped regions 


and relatively urshaped ground, and 
not of immutable fixed elements such 
as mosaic points; and that accent, or 


anchorage, is always well marked. 
There may be a moment of instability, 
when wood-thrush song is melted into 
“forest murmurings,” but figure-ground 
relations are quickly established, and 
any changes in organization must either 
occur within the relationships first laid 
down or pass through only transitory 
and limited periods of ambiguous dormi- 
nance. This figure-ground differentia- 
tion exhibits greater or lesser degrees of 
sharpness (62). It is not a single bi- 
polar process, with one figure stably su- 
perposed upon one ground. Rather, it 


2 Thus, the perceived brightness of a par- 
ticular region depends upon the distribution of 
stimulation in other regions (58); upon pre- 
ceding conditions (18) and “memory” (12) ; 
upon stimulation in other modalities (52) ; 
upon the extremely complex relationships of 
contours, proprioception, brightness itself, etc. 
which determine perceived space (20), etc. 
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is capable of extreme complication (23) 
—as the listener at Parsifal finds it 
hard t\ “take it all in” ially as 
we leave the static fixation of the labo- 
ratory for the free glance and the mov- 
ing observer of more natural perceptual 
conditions (56). 

(3) Structure depends genetically 
upon two processes: (a) differentiation 
within a more-or-less homogeneous ma- 
trix, and (b) integrations of the differ- 
entiated phases. Here reference is made 
to Herbert Spencer’s conception of prog- 
ress from homogeneous to heterogene- 
ous; likewise, to the evidence of Werner, 
Piaget, and others that diffuseness (and 
even a “syncretic” lack of differentia- 
tion between self and environment) is 
characteristic of the child’s earliest per- 
cepts, and that the development of parts 
can actually be observed as the child 
takes hold of, points to, and names spe- 
cific aspects, while other aspects of the 
environment remain “blurred” or com- 
pletely undifferentiated. The elephant 
may stand out in the child’s apprehen- 
sion of the circus—just as the elephant 
in the circus poster stands out against 
the painted jungle behind. 

Integration of the differentiated phases 
comes only as a result of differentiation 
and only as interaction occurs between 
the differentiated phases. Thus, in per- 
ceiving words as compounds of letters, it 
is at first necessary to differentiate the 
letters (which are not, in general, suffi- 
ciently “good” forms to insure such dis- 
crimination on autochthonous grounds 
alone), and then to integrate those 
letters into larger, characteristic word- 
units (the modern teacher may teach 
words before letters; this means simply 
postponing, until the child is ready, the 
analytical learning of letters). 

The same processes of perceptual de- 
velopment can be observed in the adult 
by presenting new “micro-scales” (60), 
within the musical intervals now in use, 
in which the initial indiscriminability 
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of the “notes” gives way to ready per- 
ception of tones which still later inte- 
grate to form “micro-melodies” (it is 
probably in such culturally unstressed 
areas, as tactile and olfactory percep- 
tion, that the most fruitful investiga- 
tions of perceptual development will be 
made); children presumably acquire 
our normal musical scales in like 
fashion. 

The assumption of integration as a 
_ process which may at least tentatively 

be considered as separable from differ- 
entiation is supported by the inade- 
quate integration of already-differenti- 
ated parts (evidenced dy relatively in- 
dependent spatial lability) found in 
children (60); by those cases of visual 
agnosia (44) in which the parts of 
complex presentations are perceived but 
not the interrelations between them; by 
the drawings of primitives in which 
“.. . particulars often appear as self- 
subsistent things which do not neces- 
sarily become synthesized into larger 
entities” (60, p. 139). 

Progression from (a) homogeneous 
through () differentiated to (c) inte- 
grated characterizes, in principle, not 
only the stages through which the per- 
ceptual development of the maturing 
child passes, but also (with important 
qualifications later discussed) the course 
of every perceptual process, regardless 
of age. (a) We hear Japanese spoken 
—a melodious blur; (5) we learn a 
few words and pick them out as we 
listen; (c) we follow a conversation. 
The initial response even in the adult 
is diffuse and undifferentiated. It may 
be incompletely separated from the 
“ego,” unlocalized in space and time, 
undifferentiated as to form and mo- 
dality; and it may contain strong af- 
fective and conative as well as cogni- 
tive components (5, 9). Although the 
course of such development is, with fa- 
miliar stimulus configurations, altered 
by prior differentiations and integra- 
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tions, and although the durations of 
these stages are in most cases much 
shorter in the adult than in the child 
(perhaps simply because of his greater 
experience, and quite aside from the 
matter of neural maturation), the basic 
nature of the process is assumed to be 
the same.® 

It is not implied here that the stimulus 
situation automatically determines the 
development of any percept, since there 
are usually many ways in which boun- 
daries can be cut across any inhomo- 
geneous field, and a large number of 
ways in which such boundaries can serve 
as figural contours. Admittedly, in 
some limiting cases the autochthonous 
forces of organization restrict perceptual 
development to configurations of high- 
est stability (23, pp. 106-210); with in- 
creased stimulus complexity, however, 
the number of almost equally “good” 
organizations may increase, i.e., in terms 
of stimulus considerations alone the 
situation may become ambiguous and 
any number of organizations equally 
possible. 

(4) The phenomenal aspects of the 
resulting integration are interdependent 
and exhibit membership character in the 
whole. The fear in a child’s voice is 
fully apprehended only in a context of 
perceiving the child and his situation. 
Whatever ultimate independence may 
characterize receptor processes or af- 
ferent conduction, the percept as we 
actually know it exhibits interdepend- 
ence of phenomenal phases, and no 
phase can be brought into relation with 
a punctiform stimulation whether of the 
periphery or of the cortex.* 


8It is not here implied that tachistoscopic 
exposure achieves a corresponding fractiona- 
tion of percept development. Reduced ex- 
posure time probably does little more than re- 
duce the efficacy of the stimulus. 

*It may be possible to recognize a relative 
independence of sensory-neurone processes pre- 
ceding their convergence at central levels; 
however, evidence such as that of retinal field- 
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Assuming interrelatedness of the 
physiological processes underlying the 
interrelated psychological ones, and re- 
calling that even the most spatially de- 
limited perceptual “part” depends for 
its existence and characteristics upon 
the total extended field of stimulation, 
we should expect that the integration of 
delimited “parts” involves likewise the 
integration or mutual adjustment of the 
extended processes upon which they de- 
pend. Thus, the whole field of physio- 
logical processes is altered in perceptual 
integration, and each integrated part is 
conditioned by—is “keyel” to—the 
whole field (cf. 23, p. 554) 

(5) Exteroceptive, interoceptive, and 
proprioceptive components enter fully 
and on an equal footing, but not always 
with equal weight, into the dynamics of 
perception. Those sense organs which 
make contact with the outer world, 
those which make contact with the vital 
organs, and those which make contact 
with our own muscular adjustments to 
our environment pool their sensory con- 
tributions. Rejecting the view that per- 
ception is necessarily heavily loaded 
with exteroceptive components and that 
it therefore “mediates reality,” it is here 
urged that it is simply the prejudice of 
psychologists to emphasize the distance 
receptors to an extraordinary degree 
and, therefore, to build our whole 
theory of perception upon a somewhat 
warped notion of how the organism 
works. The spatial world in which we 


effects paralleling the phenomer .] field-rela- 
tions (39), etc., makes doubtful isolation even 
at peripheral levels. Be that as it may, we 
can here admit of no trace of punctiform sen- 


sation in perception. Although the belief 
probably still exists that it is possible under 
special attitudes to inspect one’s sensations, 
such attempts generally reveal only some re- 
duction of phenomenal constancy, explicable 
as a perceptual reorganization which rotates 
the region considered into a plane frontal- 
parallel to the observer, thus yielding percepts 
similar in some ways to the physical image on 
the retina. 
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live is not constituted by exteroception 
alone. In the very grossest sense, the 
proprioceptive pattern provides an im- 
portant base to perception (anatomi- 
cally, the sensory and motor functions 
of the nervous systems are extremely 
intimate, if not inextricably interrelated 
[13]). We observe actively, muscu- 
larly, with head, eyes, neck, and much 
else besides. Visual information must 
be pooled with proprioceptive informa- 
tion as to the position and motion of the 
receptor surfaces if effective contact 
with the outside world is to be main- 
tained, and it is the excellent agreement 
of the percepts based upon these two 
factors with the world as we know it 
from other grounds which leads to the 
frequent neglect of the obvious inade- 
quacy oi é¢ither, and to the acceptance 
of a largely mythical version of visual 
exteroception as the basis of perception. 
The term “mythical” is used because 
the information yielded by any mo- 
mentary ‘fixation (devoid of scanning, 
of motion parallax, and clearly articu- 
lated only in the foveal region) bears 
little resemblance to the “visual” per- 
cepts upon which we tend to base not 
only our normal activities but (in con- 
sidering the “stimuli” to which an or- 
ganism responds) our scientific analy- 
ses as well. It may be possible, of 
course, to restrict the term “percep- 
tion” to that process which occurs dur- 
ing such momentary fixations (although 
this raises the question as to where 
such fractionation should cease), but 
this course makes most of what we 
normally call perception the province 
of “judgment,” “inference,” “memory,” 
etc. 

Although perception is based, in part, 
upon both proprioceptive and extero- 
ceptive “information,” both components 
are in large part “silent” in the forma- 
tion of the visual percept. We are no 
more directly aware of the continually 
fleeting and changing transformations 
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and patterns of retinal stimulation than 
we are of the proprioceptive stimulation 
occasioned by physiological nystagmus, 
by the acceleration of gravity, and by 
the relative positions of eye, head, and 
trunk. It is only in special cases that 
the influence of one or the other be- 
comes evident, as in situations where 
either component is manipulated in an 
“artificial” manner (i.e., without re- 
ciprocal or corresponding change of the 
other) or in situations in which careful 
analysis of the psychophysical relation- 
ships reveals systematic discrepancies 
from what would be expected by extero- 
ceptive components alone. Numerous 
studies dealing with the effects of the 
distribution of muscular tension on the 
perception of visual stimuli, and of ex- 
teroceptive stimulation on the percep- 
tion of muscular tension, are summa- 
rized by Werner and Wapner (61), 
who conclude that perception is the 
product of a fusion of dynamically 
equivalent sensory and tonic factors— 
a viewpoint close to the one here ac- 
cepted. 

It is here urged that the same ap- 
proach can be applied to the intero- 
ceptive excitation-patterns which yield 
“information” as to the state of our in- 
ternal organs and our tissue needs. Un- 
less we postulate an isolation between 
the exteroceptive and proprioceptive 
portions of the nervous system, on the 
' one hand, and the interoceptive tracts, 
| on the other, we must expect some in- 
terrelationship. Such isolation appears 
improbable (22); and even if anatomi- 
cal isolation were complete, relation- 
ship would be expected through pro- 
prioceptive changes due to autonomic 
activity. It is therefore assumed that 
our “visual” percepts are based upon 
information delivered by all afferent 
tracts, whether they be exteroceptive, 
proprioceptive, or interoceptive in 
origin. Moreover, any effector excita- 
tion pattern aroused by those cortical 


excitations correlated with a percept 
will, through consequent muscular and 
visceral effects, feed back through ex- 
teroceptive, proprioceptive, and intero- 
ceptive tracts to modify the perceptual 
organization of the pr:ceding moment. 
Thus, our conceptions of the external 
world are in part a product of what 
goes on inside us, and our conceptions 
of what goes on inside us are to some 
degree loaded by our responses to the 
external world. 

Consonant with the view of perceptual 
development, it is assumed that in the 
most primitive forms of perception all 
three sources of sensory information are 
present in an undifferentiated or par- 
tially differentiated state and that subse- 
quent differentiation gradually achieves 
more and more separation of these fac- 
tors. We would expect that in primi- 
tive stages, perception of environmental 
objects would prove inextricably fused 
with the motion, position, and feeling 
state of the observer. Considerable evi- 
dence that the behavior of very young 
children, of the drug-intoxicated, and 
of brain-damaged patients can best be 
understood as the effects of such un- 
differentiated fusion of sensorial, affec- 
tive, and motor phenomena has been 
collected and compellingly presented by 
Werner (60). The hammer is not per- 
ceived as a visual object only; it is di- 
rectly perceived as heavy, and as hard, 
and as “a thing you strike with.” In 
children before such areas are well dif- 
ferentiated, in cases of brain injury in- 
volving considerable dedifferentiation, 
and in schizophrenics in whom ego- 
object polarity is lost, it is difficult or 
impossible to get reactions in terms 
of such highly static, differentiated 
abstractions as objects-in-isolation or 
shapes-as-such. The adult separation 
between subject and object may be 
viewed as being as much a product of 
differentiation as is the distinction be- 
tween modalities, which is likewise de- 
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creased in children, in patients suffer- 
ing from drug-intoxication, etc., and is 
not always perfect even in normal adults 
(52). Even in the sophisticated adult, 
such fusion of contributions from all 
sources provides the basis of percep- 
tion: the initial stages of perception 
are, like the earliest responses of child- 
hood, a fusion of external and internal 
information. Undifferentiated affective, 
conative “pre-sensations” not specifi- 
cally localized in space or time are not 
completely differentiated as to modality 
(5). 

After differentiation is completed, the 
evidence of interaction is more difficult 
to come by. The very structure of our 
language is designed to make almost im- 
possible a report on process or context 
rather than static and abstract identity 
(24), and in the laboratory an added 
effort has been made to rule out the 
“subjective” variable presumably so re- 
moved from science. Even in the labo- 
ratory, however, one no more normally 
perceives objects completely outside of 
any affective and conative setting than 
one perceives them without spatial char- 
acteristics. Such affective character- 
istics may be attributed to the per- 
ceived object, but they probably tend 
to shift, with development, towards a 
subjective tension or need attributed to 
the observer’s perceived self, in line 
with the general developmental tendency 
towards stabilization of environmental 
objects and abstraction from their con- 
texi.® 

5 Similar relationships appear in the coop- 
eration of exteroception and proprioception, 
¢.g., in the way in which motion, if perceived 
in the object, may not at the same time be 
ascribed to the self and vice versa (57); in 
the same fashion, the affective components 
perceived as characteristics of the object may 
not at the same time be perceived simply 
as information concerning subject’s needs—a 
stage of incomplete differentiation very similar 
to phenomena subsumed under the concept of 
“projection.” 
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It is suggested, then, that as the pro- 
prioceptive components may contribute 
“silently” to our perception of spatially 
localized objects, the information from 
the vital organs may in most cases also 
be silent, yet contribute heavily, to- 
gether with proprioception and extero- 
ception, to those relationships (between 
the perceived self and perceptual ob- 
jects) which may be subsumed under 
Tolman’s concept of “sign-gestalten” 
(55), those action-impelling attributes 
of the perceived world which, in order 
to achieve “objectivity,” we so rigor- 
ously train ourselves to ignore. After 
dinner, not before, is the time to present 
our case. Much of what we term “af- 
fection” would then consist (at least 
originally) in the contribution of intero- 
ception to the exteroceptive-propriocep- 
tive pool; similarly, much of what we 
term “conation” depends upon the 
proprioceptive information contributed. 
There is, however, no warrant for identi- 
fying cognition solely with exteroceptive 
processes, e.g., with visual, tactual, au- 
ditory sensations, and excluding organic 
sensations; and there is no warrant for 
relating affection solely to interoception, 
nor for relating; conation solely to pro- 
prioception. 

(6) Jt is meaningless to separate 
sharply the cognitive, affective, and 
conative aspects of perception. A proc- 
ess must be cognitive if it is to be called 
perception, but it may be affective and 
conative as well. In the normal in- 
tact orgarism retaining interoceptive 
and proprioceptive neural components, 
it would not only be difficult to get the 
exteroceptors to work at the same time 
that one prevented the interoceptors and 
proprioceptors from working; it would 
also be improbable that the resulting 
process would be anything like percep- 
tion as we know it. Indeed, the terms 
“cognition,” “affection,” and “conation” 
become matters for phenomenological 
investigation rather than psychological 
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categories. The fact that a particular 
phenomenal region within the field car- 
ries the “flavor” of objectivity and 
cognition, and another region carries 
those of affection or conation, does not 
necessitate ascribing to them separable 
sensory origins. We cannot identify 
cognition with exteroception, affection 
with interoception, etc., since we have 
seen that cognition itself includes neces- 
sary interoceptive and proprioceptive 
elements, and we cannot talk of percep- 
tion as modified by conative or affective 
factors, any more than we can talk of 
perception as modified by kinesthesis. 
Cognition, affection, and conation are 
therefore independent only in the sense 
of a necessary and sometimes rather 
laborious scientific task of abstraction. 
Perception as such can (and, moreover, 
must) be loaded or weighted with any 
one of these three processes and still 
remain perception. 

This interdependence may perhaps 
also throw light on the problem of mo- 
tive or “drive.” Interoceptive proc- 
esses, whether we are aware of them 
or not, are of course important aspects 
of many drives, e.g., hunger. Hunger, 
while physiologically a response to tissue 
needs, may at the same time be just 
as sensory as is a patch of color. Pro- 
prioceptive processes, such as restless- 
ness, or the impulse to change from one 
posture to another, likewise have a 
drive quality. Frequently the drive to- 
wards a goal-satisfying object, like apple 
pie, involves both interoceptive and pro- 
| prioceptive (goal-seeking) components. 
Impulses from such interoceptive and 
proprioceptive sources may fuse freely 
with exteroceptive sources, and as the 
apple pie in the kitchen is smelled or 
seen, we do not merely perceive it; we 
perceive it hungrily. Hunger does not 
necessarily becloud the cognitive situa- 
tion, for the flaky crust, the spicy 
aroma, are more vividly apprehended 
than when we are neutral toward such 
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things. From such a homely example 
of the fusion of exteroceptive, interocep- 
tive, proprioceptive components we may 
proceed by gradations to Robert Burns’s 
“for to see her was to love her” and to 
Spinoza’s “intellectual love of God.” 

To become aware of our drives, in 
contradistinction to becoming aware of. 
the things we want, is usually a form 
of perception in which the interocep- 
tive processes receive emphasis; but all 
drives, insofar as we are aware of them 
at all, become, from this viewpoint, per- 
ceptual phenomena. There is no need 
for two independent categories in our 
basic psychology of immediate experi- 
ence, one for perception, one for drive. 
Both perception and drive are cognitive; 
both are affective; both are conative. 
It is just a question of convenient scien- 
tific abstraction and emphasis. 

It may be asked whether, as we list- 
lessly loll on the riverbank and eye the 
clouds, we perceive in the same affec- 
tive and conative fashion. In reply, we 
suppose that if we became so perfectly 
neutral to the summer’s day as to be 
free of any desire to drink it lusciously 
in, the interoceptive and proprioceptive 
components which we call drowsiness 
would dominate the situation. We 
should conclude by perceiving it neither 
conatively nor, indeed, in any other 
way than by going to sleep. 

(7) Perception develops through ex- 
perience as well as through neural ma- 
turation. It was suggested earlier that 
there is systematic perceptual “growth” 
dependent upon a general developmen- 
tal principle of differentiation and inte- 
gration. It seems likely, however, that 
experience also results in this kind of 
development, and that growth may be 
to some extent subsumable under dif- 
ferentiation and integration; i.e., the 
high degree of perceptual structure char- 
acteristic of the adult may be a direct 
consequence of the child’s successful 
passage through the developmental se- 
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quence of differentiation and integra- 
tion in a large variety of stimulus con- 
ditions. The members of different cul- 
tural groups have learned to perceive as 
culture requires. Certainly the present 
evidence (48) as to the roles of use and 
experience would make it unwise to set 
limits to the relative importance of these 
factors in determining the perceptual 
repertory of the organism. On this as- 
sumption, individual differences in ‘ex- 
perience, in interaction with the envi- 
ronment, will alter the rate and form of 
perceptual development. Environments 
differ in the direction and extent to 
which they force us to make distinctions 
and then to integrate what has been 
distinguished (it was for this reason that 
it was previously implied that experi- 
ments with adult human beings on per- 
cept development would not yield sim- 
ple developmental laws uncontaminated 
by the effects of past experience [15]). 

Cognition in a given situation is a 
function not only of the pattern of 
stimulation, but of the “schema,” frame- 
work, or matrix within which the cog- 
nitive contact is made, as has been 
demonstrated by the work of Sherif 
(54) and Bartlett (4), and by the many 
experiments in perceptual “set” (10, 
29, 45). If we assume that the matrix 
within which each percept develops may 
itself be so altered through experience 
that it guides in an altered fashion the 
direction and extent of the growth of 
the percept, then, although the initial 
developmental phase of each percept in 
the adult may be as undifferentiated as 
that of the child, subsequent differentia- 
tion will occur in directions previously 
found successful or satisfying if such 
growth is possible within the limits 
set by the stimulus distribution. If 
the stimulus distribution “blocks” the 
growth of the percept at any stage, 
either the development will have to take 
a new course within the framework or, 
if no successful resolution is possible, 
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an altered or new matrix (“cued,” per- 
haps, by the preceding perceptual de- 
velopment) will be brought to bear. 
The successive mutual adjustment of 
matrix, stimulus distribution, and per- 
cept in the attempt to mesh into a co- 
herent, stable organization may appear, 
at least superficially, like trial-and-error 
behavior. However, where the matrix 
and the limits imposed by the stimulus 
coincide; where stimulus distribution is 
strong and simple, allowing of only one 
organization of overwhelmingly superior 
stability; or where the matrix is rela- 
tively strong and the stimulus distribu- 
tion weak or complex and permissive 
of many alternative organizations, there 
should be no “trial-and-error” but, 
rather, a smooth perceptual develop- 
ment. 

Moreover, the point at which further 
articulation ceases should depend not 
only upon what, in the abstract, consti- 
tutes a configuration of maximum au- 
tochthonous stability but also upon the 
demands of the situation and upon the 
extent to which differentiation has pro- 
gressed in the past. The level at which 
the tourist admires masses of foliage in- 
volves far less articulation than that 
which the native, searching for camou- 
flaged game, must employ; on the other 
hand, identifying simple geometrical 
forms may prove a more difficult task 
for the jungle native than for the tour- 
ist, to whom such “signals” are a mat- 
ter of daily importance (41). 

(8) It is possible to control experi- 
mentally the phenomenal aspects of the 
genetic process, i.e., to teach the sub- 
ject how to perceive. Although it is 
granted that the sensory neurones feed 
specific stimulus-determined impulses to 
the brain, and that the basic “autoch- 
thonous” laws of organization described 
by Gestalt psychology impose further 
conditions upon what can be perceived, 
repeated experience with particular at- 
tributes of the environment builds up, 
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first, different types of differentiation 
and then different types of coherence 
between the differentiated “parts.” 

Consider the vast difference in the 
perception of a complicated electronic 
gadget by one who has built it and by 
someone else who, with knowledge 
neither of the gadget nor of electronics, 
reports a mass of colored spaghetti with 
anchorage points perhaps determined by 
brightness of color, the purely “autoch- 
thonous” simplicity of form, etc. Learn- 
ing to read must certainly impose new 
grouping upon the perception of ma- 
terial printed in the appropriate alpha- 
bet; indeed, the individual letters are 
apparently now perceived more readily 
under reduced stimulus conditions than 
are their mirror images (19), which are 
presumably equivalent configurationally. 
Insofar as individuals have learned to 
make distinctions and to interrelate 
those distinctions in ways which serve 
their needs, they must perceive in a 
manner unlike those with different ex- 
periences and with different needs. 

Of special interest are examples of 
“interference” with the factors which 
determine the figure-ground differentia- 
tion: Schafer and Murphy (53) re- 
warded their subjects along with the 
perception of certain patterns, punished 
them with others; later, with both 
stimuli presented simultaneously, figure 
and ground consistently followed the 
direction of reward and punishment. 
Subjecis failed to see in these combined 
presentations the materials which had 
been seen just before they were pun- 
ished, and saw in the presentation what 
they had seen earlier, just before they 
were rewarded. Apparently, they liter- 
ally learned to perceive one way rather 
than another.* Proshansky and Murphy, 


® Recent repetition by I. Rock and F. S. 
Fleck (49) has failed to duplicate the results 
of this experiment. It is difficult to accept 
their conditions as comparable inasmuch as 
the design of the experiment was somewhat 
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(47) by giving arbitrary rewards after 
the presentation of long lines and pun- 
ishment after short ones, caused in- 
creased estimates of line length; by 
similar procedure, they also produced a 
tendency to underestimate lifted weights. 
Another interesting example can be 
found in an experiment by Krechevsky 
(26) in which the autochthonous per- 
ceptual “law of proximity” was found 
operative in the rat only when reward- 
achievement was helped by such opera- 
tion. The process of learning by re- 
ward and punishment appears, then, to 
apply to perceptual development—per- 
haps to the same extent and subject to 
the same limitations that appear in any 
generalization about learning. 

(9) Repeated stimulation alone, with- 
out either known satisfaction or known 
frustration, also leads directly to analy- 
sis and to synthesis, i.e., differentiation 
and integration. The same passage 
from homogeneity to a differentiated, 
and from a differentiated to an inte- 
grated, level is likely to be obtained by 
sheer repetition without involving neural 
growth in the usual maturational sense 
and without involving satisfaction or 
frustration of any known nature.’ Here 
we can appeal to everyday experience or~ 
the stabilization of perception; to Rub- 
in’s demonstration (51) that familiarity 
can determine which of two alternate 
figures will be seen; to a follow-up 
study by Schafer (53, p. 341), in which 
this kind of stabilization of perception 
occurred without external rewards and: 
punishment; to Leeper’s stucy on the 
organization of incomplete figures (32) ; 
to Hanawalt’s findings (16) that in- 


altered. Further work is certainly needed. 
(It seems likely, also, that the motivation af- 
forded by rewards of four cents has probably 
changed both qualitatively and quantitatively 
since 1943.) 

* Hebb (17) has recently proposed a neuro- 
logical model involving something very much 
like differentiation and integration through 
sheer repetition. 


' 
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creased recognition of concealed figures, 
as a result of practice, was at least in 
part dependent upon the “unessential” 
parts of the concealing design becom- 
ing ground; and to a striking example 
of the effects of sheer experience in the 
recently reported work of Wallach and 
O’Connell (59) showing that perceiving 
the projection of a wire figure on a flat 
surface as the projection of a three- 
dimensional structure depended upon 
specific past experience with the pro- 
jection of that form while in rotation. 

It would seem reasonable, here, to as- 
sume the operation of something very 
much like Woodworth’s “drive to per- 
ceive” (66), i.e., that even in the ab- 
sence of external pressures, the integra- 
tion of the main “anchorage points” 
into a coherent, stable unit is an end in 
itself, an assumption in close accord 
with the Gestalt concepts. “Sheer repe- 
tition” might then serve the function of 
providing opportunity for the operation 
of this process.® 


(10) Integration tends to achieve 
homeostasis or stability and offers maxi- 


mal resistance to change. Integration 
involves a decrease in the freedom of 
each differentiated part, with mutual 
support and reinforcement of such parts 
by each other and by the whole (23). 
This reduction of freedom may tenta- 
tively be assumed as due, at least in 
part, to bringing the configuration of 
cohesive field forces (43) to the stable 
state of lowest energy (3); that is, once 
integrated, the percept would resist 
change. (It would be interesting to 
explore the problem of whether the 
achievement of stability is related to 
rigidity as it is known to psychiatrists 

8 Perhaps perception passes through sub- 
stages of relatively stable organization, depend- 
ing either upon temporal processes (as are 
ambiguous figures [25]) or upon repetition 
under slightly altered circumstances, to be 
“jogged” out of the substages and to achieve 
maximal integration. 
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and clinical psychologists. Perhaps the 
contemporary study of feed-back sys- 
tems will ultimately show that the main- 
tenance of stability in a perceptual sys- 
tem is formally analogous both to the 
physiological maintenance of homeo- 
static balance (27) and to psychopathic 
rigidity.) 

(11) “Association by contiguity” re- 
sults directly from such synthesis. 
Something phenotypically like “associa- 
tion by c~ntiguity” must occur if inte- 
gration is a direct result of repeated 
or extended presentations of a stimulus 
distribution. The integration into a 
stable percept of the various simul- 
taneously present differentiated “parts” 
must change the components so that 
they now bear membership character in 
the same organization (38), and each 
one, no longer independent, “leads to” 
the others through their mutual rela- 
tionships with the whole. This, of 
course, is not association by contiguity 
in the usual sense inasmuch as the 
“elements” do not maintain their 
identity and inasmuch as “asscciation” 
is not by addition of elements but by 
the modification of each part by its 
role in the original percept. In a sense, 
the “part” now bears the “keyed” im- 
print of the distribution of stresses in 
the original percept, and separate pres- 
entation of that part therefore tends to 
re-establish something which may be 
close to the percept aroused under the 
original conditions of stimulation. Thus, 
with the perception of such a “keyed” 
part the development of the rest of the 
field is guided along those paths (where 
permitted by the stimulus material) of 
differentiation and integration traversed 
by the original percept. This “leading- 
to” based upon previous structure (21) 
would constitute at least a large part of 
the matrix within which perception de- 
velops, although we should expect not 
the ideal operation of such tendencies 
under stationary stimulus distributions 














but, rather, continually changing, tempo- 
rally-extended interrelations of partially 
aroused and incomplete frameworks, 
shifting stimulus distributions, and the 
feed-back from the developing percept 
to both. 

Matrices may affect not only percep- 
tual selection but reintegration, or clos- 
ure, as well: an evoked matrix directs 
the development of the percept toward 
recreation of the original integration; 
and where the stimulus situation is 
close to that which evoked the origi- 
nal percept, this must result in what 
appears to be a “lowered threshold.” 
Even poorly defined or incomplete 
fragments of previously well-integrated 
wholes should lead to the reintegration 
of something close to that whole. Thus, 
with impoverished and even fragmen- 
tary stimulation, we would expect lower 
thresholds for reporting familiar forms 
than tor configurationally equivalent un- 
familiar forms. The data reported by 
Henle (19), who found lower thresh- 
holds for letters than for their mirror 
images; by Duncker (12), who found 
evidence of a “memory color” evoked 
by shape, etc.; by Postman and Bruner 
(45), who found closure of incomplete 
circles attributable to an experimentally 
induced “set” to see circles; and, pos- 
sibly, by Postman, Bruner, and McGin- 
nies (46), who found lower tachisto- 
scopic thresholds for words most con- 
sonant with “personal values,” may be 
called upon to support this hypothesis. 
Such effects presumably require that 
the stimulus distributions permit such 
organization. Thus, Duncker found 
that memory color may increase a given 
hue, not create one completely absent 
from the stimulus; and Djang (11) 
found that past experience with forms 
affects their perception when embedded 
in other forms only if the autochtho- 
nous organization of the latter is not 
determinate, permitting alternate or- 
ganization, etc. 
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We would expect that such re-consti- 
tuted aspects of the original percept 
might have effects as though actually 
sensorially present (28). Perhaps the 
“conditioned response” may come under 
this principle, in the sense that from 
appropriate excitations there arise be- 
haviors appropriate to a stimulus (or 
to expectation [55, 66] of a stimulus) 
which is not sensorially present (14) 
but which is part of the integrated, 
temporally extended world-of-action 
with which the organism is faced.° 

Guidance by such matrices in no way 
guarantees the validity of the percepts, 
or of the behavior thereby aroused: the 
completion of autochthonously strong 
figures, or of familiar ones (7), may 
very well involve erroneous perception 
of the environment, and behavior di- 
rected toward a strongly expected event 
may prove unrealistic. Far away, we 
recognize a friend by his walk; but on 
closer view the walk attaches itself to 
an alien form—and turns out to be not 
quite the walk we knew. 

(12) The processes of autistic percep- 
tion are only relatively distinct from 
realistic or objective perception insofar 
as the phenomenal content of a percept 
is concerned. The extent to which such 
perceptual development deviates from 
what would be expected from the envi- 
ronment, as we know it on other 
grounds, constitutes “perceptual distor- 
tion.” Where the percept is “distorted” 
toward an environment more consonant 
with the observer’s “needs,” such distor- 
tion is called autism.° However, all 


*It is probable that, as Koffka argued (23), 
any hypothesis of this nature must assume the 
existence of memory traces; however, it does 
not seem essential at this stage to make explicit 
assumptions as to their characteristics. 

10 “Needs,” especially in human adults, may 
often be less directly related to metabolic 
deprivations than to the highly complex “ego- 
tensions” concerning which we can learn much 
more from the personologist than from the 
physiologist ; ¢.g., the ego-mechanism may in- 
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perception involves adjustment between 
matrix and stimulus distribution; 
whether or not such adjustment “re- 
constitutes” the environment, the proc- 
ess remains fundamentally the same. 
It is, of course, perfectly possible to 
separate autistic and realistic proc- 
esses if one refers to the degree of suc- 
cess in “adapting” to the external en- 
vironment, or to performing certain 
tasks according to certain criteria of 
the experimenter. It is evident, how- 
ever, that if we do this, we are em- 
phasizing external criteria of success or 
failure, not actually describing a per- 
ceptual process. Indeed, if the environ- 
ment changes somewhat while retaining 
certain identifiable features, the same 
perceptual response may be “realistic” 
today and “autistic” tomorrow. The 
man who sees at a lower tachistoscopic 
threshold those words which fit his 
“value system” (46) is only displaying 
autism if the stimulus distributions 
around which his percepts developed 
actually constitute other words, but the 
perceptual process remains the same in 
either case. Realism and autism are 
functional terms, not terms which relate 
to the content of experience. 

(13) By varying experimentally the 
exteroceptive, interoceptive, and pro- 
prioceptive components which affect per- 
ception, one may define a full continuum 
from realism to autism. The experi- 
menter may reduce one and increase an- 
other component, and he may alter their 
balance so as to increase realism or 
autism or to get determination to a 
varying extent by the three components 
considered. Sometimes the percept is 
almost completely exteroceptively pre- 
determined: with well-articulated visual 
stimulus situations, a proprioceptive ex- 
citation pattern can be one which would, 


clude ways of coping with threat which, 
phenotypically, may be revealed by distortions 
apparently aimed at perceiving that which the 
organism fears. 
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alone, result in the perception of the 
vertical, for example, considerably dif- 
ferent from that indicated by vision, 
yet the perceived vertical will, for most 
observers, remain that of the exterocep- 
tive pattern (65). Vary, however, the 
strength of the proprioceptive stimula- 


‘tion relative to that of the exteroceptive 


stimulation (or the extent to which the 
individual habitually relies upon one or 
the other [64] ), and the perceived verti- 
cal apparently may assume compromise 
positions anywhere between those dic- 
tated by the proprioceptive and by the 
exteroceptive excitation (cf. 2, 34, 42). 
Altering the balance of interoceptive and 
exteroceptive c.mponents will probably 
likewise alter the closeness with which 
the response is anchored to the external 
stimulus situation. 

As a first approximation, we may as- 
sume that increased domination of per- 
ception by nonexteroceptive components 
can be achieved by increasing the in- 
tensity of the nonexteroceptive com- 
ponents and decreasing the effectiveness 
of the exteroceptive determinants. The 
latter may be achieved by any method 
which increases the ease with which al- 
ternative figural organizations can be 
fitted to the exteroceptive stimulation. 
Such situations often occur before dif- 
ferentiation is complete (when alterna- 
tive boundaries may be set up), and 
after bounded regions have separated 
but before they have been integrated 
into a stable structure. The probability 
of meeting these conditions can be in- 
creased by (a) the use of any method 
which impairs structuring of the extero- 
ceptive pattern, whether it be by the 
use of short exposure (46), low illumi- 
nations (50), or decreased clarity of the 
stimulus-inhomogeneities upon which 
contours may form (as by the use of 
ground-glass diffusing screens {[33]); or 
(6) the prezentation of complex un- 
familiar patterns (32) or material yield- 
ing unstable (i.e., of more than one 
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equally “good” alternative structure) or 
potentially reversible figure-ground dif- 
ferentiation (53). These are the situa- 
tions most explored in studies of autism, 
perceptual defense, and in projective 
techniques. 

(14) Satisfying aspects of percepts 
tend to become figure in a progressively 
more focused manner. The organism 
is often sensitive to several goals at 
once, its neuromuscular system being 
incompletely focused, but it becomes 
more and more focused or set toward 
those specific goals which have in the 
past proved satisfying (figure-ground 
segregation is not necessarily all-or- 
none but may be a matter of degree). 

In terms of the hypotheses presented, 
with perception a temporally-extended 
developmental process, integration must 
involve the bringing-together of tem- 
porally separated parts (if only through 
the interaction between those parts 
which overlap temporally). The extent 
to which such integration occurs deter- 
mines the extent to which each part is 
modified so as to “lead” to the future 
pattern (as a few notes from a melody 
lead to its completion) as well as con- 
temporary “keyed” parts. 

Where such integration—or structural 
resolution—depends chiefly upon the 
limitations set by the exteroceptive 
stimulus conditions we may, if we wish, 
refer the resultant integration to the 
reinforcement of a drive-to-perceive, as 
does Woodworth (66), or to the achieve- 
ment of a configuration of maximum sta- 
bility, as would the Gestaltists. Where 
integration awaits the resolution of 
interoceptive components aroused by 
tissue needs, the perceptual integration 
of greater stability will usually be the 
one that generates actual behaviors 
which reduce the tissue need (33). 
Upon rearousal of perceptual parts to 
which such interoceptive information 
has contributed, those parts will “lead 
to” the completed perceptual integra- 
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tion and, therefore, indirectly to that 
kind of action which relieves the needs. 
Here, as before, we can refer either to 
the concept of reinforcement or to the 
achievement of stable perceptual con- 
figurations and still be referring to the 
same process; from this viewpoint the 
controversy regarding the Law of Ef- 
fect is purely terminological. 

Thus, the strength of different needs 
with which the individual is faced and 
the history of their satisfaction or their 
frustration become fundamental in the 
determination of any characteristic dif- 
ferences in personal outlook. 

(15) The trend toward integration 
tends to be irreversible. This hypothe- 
sis is certainly both controversial and 
far from capable of full verification. It 
has k-en assumed (p. 333) that pro- 
gressive differentiation tends to occur 
within the major boundaries already 
laid down, and we know (23) that those 
regions of the field having the greatest 
internal articulation tend toward figural 
status; since integration occurs at each 
successive level, reversal, in the sense 
of eliminating boundaries or contours 
already in existence and creating new 
ones, would be difficult. Once “insight” 
was gained, for example, in Boring’s 
ambiguous figure of the old-and-young 
woman, Leeper found it almost impos- 
sible to see the other aspect (32). 
Leeper also found that in material of 
the type of Street’s Gestalt Completion, 
figures once perceived in a given way 
were difficult to get rid of, and appar- 
ently prevented reorganization. Such 
percepts represent convergences of high 
order, so that their destruction is im- 
probable. In order to see any other 
figure, the integration would have to be 
rent asunder and dedifferentiation oc- 
cur, often through many concentric lev- 
els. Thus, the earliest anchorages and 
“associational” integrations precondition 
later perceptual development and tend 
to be permanent or to have permanent 
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effects on further perception and be- 
havior. 

It should be pleaded here that there 
is a good deal of evidence from Freud’s 
studies of cathexis that early trends 
in this direction are never outgrown. 
Childhood loves, whether of mothers or 
of doughnuts, are never wholly lost. In 
fact, psychoanalysis and other therapies 
are here presumed to be necessary in 
part because of early accentuation of 
certain satisfying aspects of the per- 
ceptual field which the organism cannot 
later leave or submerge effectively un- 
der new cathexes. The attempt is made 
elsewhere (40) to show that this prin- 
ciple, called canalization, is supported 
by considerable evidence, and to sug- 
gest that while such canalizations may 
be overlaid or disguised, the moment 
that the overlay is removed, they re- 
bound and reappear in full jorce. 

(16) Those aspects of a percept which 
are only prominently “associated” with 


“satisfaction” tend toward the role of 
figure but are displaced when this con- 


nection ceases. This hypothesis refers 
to the evident daily fact that much 
which tends to attract our notice, and 
which plays a rather large dynamic part 
in our feeling and acting, is neverthe- 
less pushed aside when it is no longer 
related to the satisfaction of our wants. 
The dinner bell rung repeatedly as a 
practical joke becomes sheer clanging 
nuisance. Signals of various sorts which 
key us up and give us joy hold this role 
as long as the anticipated satisfaction is 
actually forthcoming, or as long as we 
maintain an expectancy (having no rea- 
son to believe otherwise); but when the 
relationship is broken, the role is quickly 
lost. ‘This may be what is denoted by 
the extinction of the conditioned re- 
sponse: what was once perceived as “to- 
be-followed-by” satisfaction, now has 
lost those sign-gestalt characteristics; 
what once held the center of the stage 
retreats into relative obscurity. 
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The difference between these last two 
points (hypotheses 15 and 16) dealing 
with “canalization” and “conditioning,” 
respectively, lies in the fact that those 
anchorages which are themselves di- 
rectly “satisfying,” within which further 
stable organization has been achieved, 
cannot be readily discarded; but those 
which owe their connection with satis- 
faction only to the temporary advan- 
tageous position of “signal” or subgoal 
status lose it when the connection is 
broken. In Schafer’s experiment, the 
contours which achieved dominance 
through reward lost this dominance 
when reward was discontinued, since 
such contours were of differential im- 
portance in achieving a stable integra- 
tion only insofar as they were seen as 
leading to reward. Lambert, Solomon, 
and Watson (30) found that the size 
of a poker chip was consistently over- 
estimated (a phenomenon for which no 
clear-cut explanation exists at present) 
while that chip served ss a subgoal in 
the path to obtaining a reward. When 
the reinforcement ceased, the relative 
overestimation of the poker chip also 
ceased, 

It would accord with these assump- 
tions (13-16) if all learning should 
prove reducible to a matter of percep- 
tual restructuring. There is a wide- 
spread tendency at present to emphasize 
the flexibility or malleability of percep- 
tion and to point out that changes in 
behavior follow directly from changes 
in the way in which a situation is 
viewed, so that it is an easy step from 
this point to the proposition that learn- 
ing follows from perceptual reorganiza- 
tion. Education and therapy, as many 
alert observers have suggested, consist 
of fresh perception. 

(17) Perceptual restructuring can oc- 
cur at any level of awareness, so that 
learning while asleep, or learning by in- 
vertebrates, etc., need offer no excep- 
tions. It is suggested that awareness 
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may be regarded as a question of de- 
gree, and, therefore, that while the most 
explicit, sharply differentiated and richly 
structured percepts occur at a level upon 
which introspection can report, there 
may well be—and probably are—in 
adults, in children, and in animals, 
less well-defined perceptual structures 
which, as a consequence of poor differ- 
entiation, defy the precise analysis nec- 
essary for introspective report and con- 
scious manipulation. Indeed, one might 
follow the older suggestions of Leibnitz 
in carrying awareness down one step at 
a time until we conceive very vaguely a 
most primitive and blurred type of ex- 
perience. Functionally we might go 
down below that and say that even at a 
level which we cannot regard as con- 
scious at all, processes of structuring 
and organization occur as if they had 
that quality we know as awareness. 
Moreover, there is considerable evi- 
dence of the existence of sets (perhaps 
directly analogous to what have here 
been dealt with as perceptual matrices), 
or “unconscious determining tenden- 
cies,” of considerable complexity of dif- 
ferentiation (10, 29), whose effects 
(while obtained without conscious par- 
ticipation) are so similar to those of 
conscious discrimination and inspection 
as to suggest the existence of a “pre- 
perceptual observer” in much the same 
fashion as that evidenced by many of 
the “perceptual defense” studies (35, 
36). We see what is coming, so we 
manage not to see it. The organiza- 
tional structure of such percepts, and 
of others suggesting “unconscious per- 
ception” (37), is not as yet separable 
from that of processes in which aware- 
ness is evidently present but the proc- 
esses involved are unmistakably percep- 
tual in nature. 

(18) The more complex phenomena 
of conflict, ego-defense, and psycho- 
analytic dynamics generally may be 
viewed as anchorage and differentiation 
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phenomena in which two or more as- 
pects of a perceptual field can make a 
sharp bid for the role of figure. Here it 
is suggested that instead of being com- 
pletely unconscious of our own strug- 
gling and conflicting tendencies, there 
are phenomena essentially like those of 
perceptual conflict within us at all lev- 
els of awareness. Awareness of our- 
selves, of our own inadequacies, of the 
different directions in which we might 
like to move; awareness of the modes of 
escape and of reconciliation of wishes— 
are all there according to the present 
thesis, in varying degrees of clarity or 
obscurity, dissociation or integration. 

There is more than one way in which, 
from this viewpoint, our behavior may 
be determined by material of which we 
are unaware, and it would prove in- 
structive to examine the behavioral ma- 
terial presently considered homogene- 
ously as “repression,” to determine 
whether or not it can be fitted into such 
diverse categories. To select a few pos- 
sibilities: those aspects of the field 
which are relegated to ground status 
still affect the organization but are not 
noticeable as forms (23). An _ indi- 
vidual might react to several different 
objects or situations in the same fash- 
ion, not because of an unconscious per- 
ception of their similarities—as ex- 
pressed in the concept of substitution— 
but because of an inadequate differ- 
entiation between them, a lack of per- 
ception of their differences (31). Re- 
pression might in some cases be analo- 
gous to the process by which Schafer’s 
subjects rejected perceptua! possibilities 
which led to frustration, or to that by 
which Chapman’s subjects were unable 
to report the attributes not included in 
the Aufgabe (10). In other cases per- 
ceptual repression might be analogous 
to an inability to allocate phenomenal 
causal responsibility to any portion of 
a relatively amorphous, insufficiently 
differentiated field; etc. 
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All that psychoanalysis and the other 
depth psychologies have taught us may 
easily be conceived in terms of the 
learning process, governed by needs and 


their cognitive expression, exactly as in 


the simpler experimental examples, ex- 
cept that continuous conflict and inade- 
quate differentiation contribute to the 
low degree of awareness which some 
components enjoy, and except that per- 
ception of the self in all its rich and 
complex manifestations must receive 
cardinal emphasis. But perception of 
the self could still exhibit the charac- 
teristics of perceptual development, fig- 
ure-ground dynamics, cazalization, and 
the rest of the principles suggested. 


SUMMARY 


Our hypotheses have dealt with the 
interaction of exteroceptive, interocep- 
tive, and proprioceptive components in 
the determination of perception, and 
with the likelihood that both growth and 
learning modify the form of this inter- 


action. 

It has long been emphasized and, 
under the stimulus of feed-back theory, 
it is currently being re-emphasized (63), 
that as we react to our percepts, our 
bodies feed in impressions propriocep- 
tively just as our distance receptors 
feed in, at the same time, reports on the 
changing situation. What has here been 
suggested is that it is unjustified to con- 
sider one of these components alone as 
the basis for perception by the normal, 
functioning organism in its intercourse 
with its environment, and that the 
visceral-affective components may also 
be vitally concerned ir perception; that 
the study of perception includes the af- 
fective and conative phenomena and 
their interrelations with the cognitive 
in such a way that the whole, living in- 
dividual participates in every perceptual 
act; and lastly, a possible relationship 
has been suggested with the learning 
process and with those personality dy- 
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namics which the psychoanalysts and 
other clinicians have stressed. 

These hypotheses are, of course, only 
programmatic and worth but little till 
the hard work of thinking them out, 
testing them, and modifying them has 
been done. It is hoped that personality 
research-methods may in time justify 
the conception that the perceptual dy- 
namics to which these methods point 
are ultimately the same perceptual dy- 
namics which in the laboratory can be 
investigated in terms of figure-ground 
differentiation and other parameters. 
There is only one psychology of percep- 
tion, and it should be equally good in 
the laboratory and in social reality if it 
is solidly constructed. 
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TWO-FACTOR LEARNING THEORY: SUMMARY 
AND COMMENT * 


BY O. H. MOWRER 
University of Illinois 


I 


Whenever one abandons one “faith” 
and embraces another, there is usually 
a compulsion to apologize, or at least 
explain. The explanation, if not apol- 
ogy, in my own case for having de- 
serted the ranks of those who subscribe 
to a monistic conception of learning and 
gone over to a form of dualism goes 
somewhat as follows. 

The law of effect, as developed by 
Thorndike and modified by Hull into a 
strictly monistic theory of reinforce- 
ment, has sometimes been charged with 
being teleological. In its most primi- 


tive form this charge is not hard to 
In keeping with phraseology 


refute. 
laid down by Thorndike, the law has 
commonly been stated as follows: If a 
stimulus situation produces a given re- 
sponse and if that response is shorily 
followed by reward, the response will 
be strengthened. Critics have rightly 
lampooned this careless wording of the 
law, noting that one cannot speak of a 
response being strengthened by events 
that follow it—that is, not without ac- 
cepting teleology. The solution, how- 
ever, 1s obvious: All we have to do is 
to note that it is not a particular “re- 
sponse” that is affected by an ensuing 
reward; rather is it the responding or- 
ganism which is changed, changed neu- 
rologically in such a way that in the fu- 
ture its behavior, under similar circum- 
stances, will be altered. 


* This paper and the five which follow were 
prepared for a symposium which took place 
at the 1950 meetings of the American Psycho- 
logical Association at Pennsylvania State Col- 
lege under the auspices of the Division of 
General Psychology. 


But the spectre of teleology was soon 
to return in a new guise. In their 1940 
book, Hilgard and Marquis (3) called 
attention to a number of experiments, 
particularly one carried out by Brogden, 
Lipman, and Culler (1), in which it 
appeared that animal subjects were ca- 
pable of being rewarded by the mere 
avoidance of noxious stimulation. The 
question which arose here was an awk- 
ward one: How could an animal’s be- 
havior be influenced, not simply by 
subsequent events, but by “subsequent 
events” that do not occur? The pro- 
cedure used in these experiments was 
one that Hilgard and Marquis termed 
“instrumental conditioning.” But the 
use of the term “conditioning” posed a 
paradox: Conditioning has been as- 
sumed to depend upon the association, 
or paired presentation, of so-called con- 
ditioned and unconditioned stimuli; and 
yet in the studies cited it was found 
that better “conditioning” is obtained 
if the CS is followed by the UnCS only 
when a CR fails to occur, instead of in- 
variably. Here it appears that reward, 
as well as conditioning, is in some way 
operating. But to speak of an animal’s 
being rewarded by the non-occurrence 
of something left much to be desired by 
way of explication. 

The resolution of this paradox was, 
in one way, relatively straightforward. 
If a signal has been paired a few times 
with a noxious event, such as an elec- 
tric shock, it will, as everyone knows, 
acquire the capacity to elicit fear. And 
Lamoreaux and I were able to show in 
a series of studies (8, 9, 10) that a re- 
sponse which is instrumental in avert- 
ing an impending noxious event gets re- 
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inforced, not by the non-occurrence of 
that event, but by reduction in the in- 
tensity of the fear which foreshadows it. 

This finding led to a number of other 
studies which have confirmed the hy- 
pothesis that living organisms can be 
rewarded, relieved, reinforced quite as 
effectively by a reduction in the in- 
tensity of fear or any other form of 
secondary motivation as by the lessen- 
ing of a primary drive, such as pain or 
hunger. 

These results seemed, at first, to put 
reward theory on an even sounder basis 
as an all-inclusive explanation of learn- 
ing, since they demonstrated that drive 
reduction is effective in reinforcing be- 
havior regardless of whether we are 
dealing with a primary drive or a sec- 
ondary drive. But on second thought, 
one began to wonder precisely how it 
is that acquired, or secendary, drives 
themselves come into existence. Are 
they, too, dependent upon reward? 
Common sense would suggest that it is 


the temporal coincidence of a signal 
and pain that makes the signal later 


elicit fear. But pain usually abates 
eventually—in the laboratory relatively 
quickly—and it was therefore possible 
to hold that, even here, it was drive 
reduction rather than sheer contiguity 
that somehow provided the reinforcing 
state of affairs. 

This reasoning seemed to lead to a 
testable deduction: If the reinforcement 
for fear learning occurs when the nox- 
ious stimulus which is forecast by a 
signal terminates, and if that signal 
lasts only until the onset of the noxious 
stimulus, then there should be poorer 
fear learning when there is a long in- 
terval between the onset and end of the 
painful stimulus than when this interval 
is of short duration. In other words, 
under the stated conditions, a signal 
which is followed by a brief painful 
stimulus should acquire the capacity tc 


1See Miller’s recent review (4). 
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elicit a stronger fear response than 
should a signal which is followed by a 
painful stimulus of prolonged duration. 
However improbable this inference may 
sound, it follows logically from the 
stated premises, for we know from other 
studies that in conditioning, other things 
equal, the briefer the interval between a 
stimulus and the reinforcing state of af- 
fairs, the greater the learning. 

No one, I believe, has put this par- 
ticular deduction to an experimental 
test; and, indeed, from the standpoint 
of common sense as well as indirect 
scientific evidence, it would be rather 
amazing if the deduction were con- 
firmed. However, about four years ago, 
John Suter and I, using the same general 
logic, designed and carried out a re- 
lated experiment which gave completely 
negative results, i.e., results which were 
negative as far as the reward theory of 
fear-learning is concerned. The results 
pointed to the conclusion that fear 
learning occurs according to a strict 
contiguity principle. 

These results have already been briefly 
reported and will be published in detail 
shortly, so I shall not describe them 
further here. I do not say that they 
are “crucial,” but they served to give 
my confidence in monistic reward theory 
a severe jolt and prompted me to make 
a rather searching examination of the 
possibility that there are two forms of 
learning: (1) sign learning, or condi- 
tioning, on the one hand, and (2) solu- 
tion learning, or problem solving, on 
the other. The results of that examina- 
tion were published in 1947 (5) and a 
study entitled “Further evidence for a 
two-factor theory of learning” is now 
in press (6). On the basis of these 
inquiries I have become a confirmed 
dualist. 


II 


Here I skall give only a brief resumé 
of the two-factor conception of learn- 
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ing which I now believe most defensible. 
This I shall do under four headings: 

(1) Definitions. One of the chief 
obstacles to the emergence of a clear- 
cut, two-process conception of learning 
has been the non-precise way in which 
conditioning and problem-solving have 
often been defined. Certainly a con- 
ception of problem-solving, or trial and 
error, which restricted this form of 
learning to behavior motivated exclu- 
sively by primary drives, was inade- 
quate and incomplete; what we needed 
to recognize is that reward learning can 
and does occur with equal facility when 
the underlying problem or drive is ac- 
quired rather than biologically given. 
But the principal problem arose in con- 
nection with a tendency to over-extend 
the term conditioning and make it in- 
clude many responses which qualify very 
acceptably as problem solving when this 
latter process is properly conceived. In 
order to make a two-factor theory work, 
“conditioning,” it now appears, must be 
restricted to the process whereby emo- 
tions, meanings, attitudes, appetites, 
and cognitions are acquired, with “solu- 
tion learning” applying to all cases of 
overt instrumental habit formation. 

(2) Anatomy. Solution learning seems 
to be largely dependent upon the central 
nervous system and to involve responses 
which are mediated by the skeletal 
muscles, whereas sign learning impor- 
tantly involves the autonomic nervous 
system and is manifested by viscero- 
vascular reactions. The first class of 
responses are, in the main, voluntary, 
and the latter involuntary. 

(3) Description and Measurement. 
Sign learning is often characterized, 
quite correctly, as involving stimulus 
substitution or associative shifting 
(Thorndike), whereas solution learning 
is often described as involving response 
substitution. The one may be properly 
said to lead to responses that are antici- 
patory and the other to responses that 
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are consummatory (Sherrington). In 
terms of the laboratory methods which 
are best adapted to the study of these 
two forms of learning, sign learning is 
usually represented as an incremental 
function (incidence of response) and 
solution learning as a decremental func- 
tion (reduction in time or errors). 

(4) Implications and Supporting Evi- 
dence. In the 1947 paper already cited, 
it was shown that two-factor theory 
satisfactorily resolves a number of con- 
tradictions and paradoxes of long stand- 
ing. In the interim I have found it 
useful in the analysis of language learn- 
ing, perceptual phenomena, identifica- 
tion, and neurotic processes. Collateral 
evidence, which I had previously over- 
looked or which is being obtained 
through new experimentation, continues 
to accumulate and will be presented in 
a publication which will appear shortly 
(6). 

Ill 


In the time that remains I should like 
to refer briefly to a recent paper by W. 
Horsley Gantt (2), entitled “Psycho- 
sexuality in animals,” by way of illus- 
trating the continuing -mergence of evi- 
dence for a two-factor conception of 
learning, sometimes from unexpected 
quarters. The main object of Dr. 
Gantt’s paper is to show, on the basis 
of a wide range of evidence, that sexual 
excitement has somewhat the same ef- 
fect as does sedation or reassurance: 
it brings about a reduction of fear and 
anxiety. Perhaps this is a substantia- 
tion of the familiar assumption that the 
sympathetic and the parasympathetic 
divisions of the autonomic nervous sys- 
tem act antagonistically and that any- 
thing that will powerfully excite the one 
tends to depress activity mediated by 
the other. Here, perhaps, is an exam- 
ple parallel to what is known clinically 
as the “anxious eater,” the person who 
eats, not just to allay hunger, but who 
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uses oral and gustatory stimulation as 
a means of producing parasympathetic 
excitement and quelling, at least for the 
moment, affective states mediated by 
the sympathetic division. 

This jockeying of one division of the 
a. n. s. against the other and its bear- 
ing upon a two-factor conception of 
learning has been considered elsewhere 
(5). Here we are concerned with a 
more or less incidental observation in 
Gantt’s paper, but one that bears cen- 
trally on two-factor theory as formu- 
lated above. Gantt says: 


“Finally, the comparative physiology of 
the cardiac, secretory and motor condi- 
tional reflexes reveals what may be a 
basic constitutional mechanism for nervous 
dysfunction on the higher level of behav- 
ior. Our studies show that the cardiac 
component is formed more quickly than 
the motor component of the conditional 
reflex to pain and that the cardiac com- 
ponent persists much longer. 

“The cardiac component is both more 


sensitive in appearing first and paradoxi- 
cally more stable in persisting longer. With- 
out reinforcement, it has endured as long 
as two years after the motor component has 


dropped out. If the cardiac acceleration 
represents the inner emotional aspect, and 
the motor conditional reflex the specific ex- 
ternal component, we have the picture of 
an organism externally in adaptation to the 
environment but internally excited and re- 
acting to the trace of a conditional stimulus 
which long since ceased to have its former 
significance. The 2nimal has achieved ex- 


ternsi adaptation but ernotionally it is mal- 


adapted. Thus, there is a kind of cleav- 
age between the outer specific muscular 
and the inner cardiac expression” (2, pp. 
49-50). 


It is evident that Gantt is not think- 
ing here in terms of two-factor learning 
theory, but the reported facts are con- 
gruent with such a theory. They show, 
as have other studies, that in an ex- 
perimental situation involving painful 
experiences, the fear (cardiac) reaction 
is acquired first and that the motor re- 
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sponses which then appear are indeed 
caused by the fear and represent efforts 
on the part of the organism to deal 
therewith. With the passage of time or 
the occurrence of extinction, the under- 
lying emotional reaction may gradually 
weaken to the point where it will no 
longer elicit overt, problem-solving be- 
havior but may still be detectable as a 
faint conditioned, é.e., physiological, re- 
action. 

In this context Gantt speaks of an 
interesting phenomenon which I recall 
hearing Walter R. Miles refer to orally. 
Gantt says: 

“Observations have been made by White- 
horn in patients and by Hoffmann ad me 
in dogs that are perhaps related ... . viz. 
the fact that in an individual, at least when 
a spontaneous moverut is made, it is 
preceded rather tha": followed by an ac- 
celeration of heart rate. The nature c‘ the 
spontaneous movements, i.e., whether the 
dog changes from sitting to standing or 
vice versa or merely shifts his weight, is 
immaterial—first there is the cardiac ac- 
celeration, then the movement, or as Hoff- 
mann has put it, ‘first comes the heart rate 
and then the dog.’ From these observa- 
tions of normal behavior, it appears that 
some natural excitation in the central nerv- 
ous system, expressed by an increase in 
heart rate, not only results in but rapidly 
expands itself in muscular movements” (2, 
p. 48). 


These two sets of observations by 
Gantt graphically emphasize the double 
function of visceral mobilization of the 
type which is represented by increased 
heart rate. Gantt’s latter remarks in 
this connection show that visceral mo- 
bilization may occur simply by way of 
physiological preparation for action on 
the part of the skeletal musculature, ac- 
tion that may be motivated by hunger, 
postural tension, or the like. But be- 
fore an organism, at least before a 
“higher” organism, makes a gross, overt 
response under these conditions there is, 
presumably, an “intention” to act, con- 
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sisting of an implicit mediating (sym- 
bolic) reaction of some kind; and this, 
having commonly preceded gross muscu- 
lar activity (which depletes physical en- 
ergy and thus reflexly elicits physio- 
logical activity), has therefore become 
a “conditioned stimulus” for physiologi- 
cal activity. Here we appear to be 
dealing with autonomic-visceral reac- 
tions which are proportional to the 
physiological needs of the organism and 
involve no element of over-preparation, 
no excessive internal pressure or ten- 
sion, no motivation. But when au- 
tonomic-visceral reactions occur which 
exceed such needs, they take on moti- 
vational properties, and we call them, 
or at least the sensations they produce, 
emotions. 

We see, therefore, that conditioned 
reactions, in the strict sense of that 
term, may serve one or both of two 
purposes: They may provide 2 physio- 
logical background, or substructure, for 


_ energetic, overt behavior; and they may 


also serve to motivate, or cause, behav- 
ior. A conditioned reaction may there- 
fore be anticipatory in a double sense: 
it may anticipate some external event, 
such as pain or injury, and in so doing 
provide motivation for behavior which 
serves to evert the event; but, at the 
same time, the conditioned reaction also 
anticipates the physical needs that will 
be created by such action—or by ac- 
tion which is created, as in the second 
type of example given by Gantt, by 
other needs. 

Conditioned responses, as we thus 
see, are intimately related to physiology, 
as distinct from problem-solving behav- 
ior, which is mediated by the central 
nervous system and the skeletal muscu- 


lature. It is perhaps not unreasonable 
to suppose that such very different 
types of activities on the part of living 
organisms should exemplify two distinct 
learning processes. 
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The question of whether a multiplicity 
of processes must be postulated in order 
to account satisfactorily for the phe- 
nomena of animal learning poses two 
independent but related problems. The 
first is a phylogenetic problem for which 
the facts of comparative anatomy sug- 
gest a clear answer. Since the behavior 
of the different animal species is modi- 
fied through the operation of different 
organic systems, the question, from the 
phylogenetic viewpoint, is no longer 
whether a multiplicity of processes exists 
but whet are the characteristics of the 
diverse processes that make learning 
possible at the various levels of ani- 
mal life. The second problem is intra- 
specific rather than inter-specific. It is 
concerned with the nature of the proc- 
esses which underlie learning in any 
given animal form. This paper, then, 
is divided into two sections. In the 
first we shall deal briefly with some of 
the implications of comparative psy- 
chology for learning theory, while in 
the second we shall be concerned with 
the intra-specific problem. We have 
chosen to consider cognitive theory in 
our discussion because we {eel that it 
provides an excellent vehicle for the 
analysis of both aspects of the multiple- 
process problem. — 


There can be little doubt that the: 


most influential criticism of a pure re- 
inforcement theory of learning has been 
provided by the cognitive theorists un- 
der the leadership of Professor Tolman 
(15). The effectiveness of this criti- 
cism is revealed both by the amount of 





experimental work which reinforcement 
theorists have devoted to attacking the 
weaker aspects of Tolman’s position 
and by the silence they have maintained 
concerning some of the most crucial 
items of evidence adduced by the cogni- 
tive theorists. However, although we 
are indebted to Professor Tolman for 
repeatedly calling the attention of psy- 
chologists to phenomena of learning 
which stimulus-response psychologists 
have tended to overlook, we believe 
that his systematic point of view shares 
a vital weakness with reinforcement 
theory. Both positions are anti-com- 
parative, functionally if not explicitly, 
in that they lead to formulations which 
are so gross as to be applicable to all 
animal forms—not only to rat, the fa- 
vorite subject for investigation, but to 
paramecium and to man. True, Pro- 
fessor Tolman’s occasional reference to 
“capacity laws” may be regarded as a 
bow in the direction of phylogenesis, 
yet it is difficult to see why the edu- 
cated paramecium of Day and Bentley 
(4) could not be characterized, in Tol- 
man’s terms, as a conscious, purposeful 
creature which develops cognitive maps 
(however narrow) and acquires sign- 
gestalt-expectations. In Professor Hull’s 
terms, of course, changes in the behav- 
ior of the trained paramecium might be 
thought of only as another instance of 
the strengthening of stimulus-response 
connections under the influence of need- 
reduction, 

The adoption by both theorists of 
what has become known as the molar 
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viewpoint leads inevitably to the blur- 
ring of phylogenetic distinctions. It 
is sometimes useful in the preliminary 
stages of research to think of behavior 
in functional terms which make no im- 
mediate reference to underlying bodily 
processes, but it is inexcusable to cling 
doggedly to gross formulations which 
are based upon the end results of be- 
havior and which are permanently di- 
vorced from underlying mechanisms. 
The processes of modification in para- 
mecium and rat are not qualitatively 
identical because the two animal forms 
are anatomically quite dissimilar. Any 
theoretical approach which glibly ob- 
scures this difference should be recog- 
nized as the most dangerous variety of 
subjectivism. The so-called molar view 
is sometimes justified in the name of 
parsimony, but this clearly is to mis- 
understand the meaning of the con- 
cept. In the case of the paramecium— 
to return to our rather extreme exam- 
ple—the superficiality of both molar 
formulations is demonstrated in an ex- 
periment suggested by Buytendijk (3). 
Preliminary immersion of a naive con- 
trol animal in a solvent renders it just 
as capable of escaping from the capil- 
lary tube as an experimental animal 
patiently practiced after the manner of 
Day and Bentley. This finding makes 
it clear that improvement in the trained 
animal may be the product of increased 
flexibility resulting from activity-pro- 
duced chemical changes in protoplasmic 
constituents, and may have no relation 
to the specific stimulating conditions 
encountered in training, or to the na- 
ture of the ends attained by the re- 
sponses which terminate each training 
trial. Theoretical frameworks of the 
molar sort not only provide no pegs on 
which to hang such findings, but tend 
to discourage the fundamental research 
which produces them. Here is parsi- 
mony by exclusion—by exorcism. 

This common failing of cognitive and 
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reinforcement theory must not be ob- 
scured by the fact that the two formula- 
tions are couched in very different lan- 
guage. Professor Tolman likes to talk 
in phenomenal terms frankly borrowed 
from the introspectionists, while Profes- 
sor Hull prefers a dialect of the Wat- 
sonian language. Certainly Professor 
Tolman’s formulations are dangerously 
anthropomorphic, but in a fundamental 
sense so also are those of Professor Hull. 
Anthropomorphism is dangerous not be- 
cause it humanizes animals, but because 
it encourages us to be content with con- 
ceptions which are nothing more than 
loose analogies and which therefore ob- 
scure real differences among phyloge- 
netic levels. Hullian theory falls into 
the same error. Only the hind of 
analogy upon which it is based is dif- 
ferent. 

A psychology without mind may be 
conceivable, but a psychology without 
mind or body is not. If we take the 
facts of evolution seriously, then the 
answer to the first question before us is 
clear. There is not one learning proc- 
ess, but many. We should not be at all 
surprised to find qualitatively distinct 
processes at every major level of ana- 
tomical organization. Certainly we may 
expect to find general processes which 
are essentially similar over a wide range 
of organisms in the phylogenetic series, 
but we cannot be content until we have 
found differences in functional capacity 
consonant with the wide differences in 
structural organization which confront 
us. 

The history of learning theory shows 
an ever increasing gap between the phe- 
nomena which are presumably ‘under 
consideration and the theoretical sys- 
tems that have been constructed to deal 
with them. It is only living organisms 
that learn, and any systematic discus- 
sion of learning must, if it is to remain 
relevant, refer to the characteristics of 
these organisms in all their similarity 
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and diversity. Behavior which charac- 
terizes the rat may be quite atypical of 
the earthworm, and even the briefest 
glance at man demonstrates his differ- 
ence from the stereotyped ant or the 
lowly amoeba. Operational ease can- 
not be substituted for the hard digging 
of science, and the ubiquitous white rat 
is an inadequate model upon which to 
base our conceptions of the psychologi- 
cal natures of all other organisms. . In 
the concentrated and often heated ac- 
tivity of system making, the simple 
truth that organisms exist at different 
phylogenetic levels has been lost, and 
with it the essential problem for learn- 
ing theory—the search for the mecha- 
nisms which make behavioral modifica- 
tion possible. 

When we turn from the broad phylo- 
genetic problem to an inquiry intc the 
nature of modification ii any given ani- 
mal form, such as the rat, similar con- 
siderations are relevant. Here we no 
longer have so strong a reason to antici- 


pate the discovery of a multiplicity of 
qualitatively distinct processes rather 
than a single basic process, but once 
again it may be well to proceed cau- 


tiously. Professor Tolman has sug- 
gested that “our familiar theoretical 
disputes about learning may perhaps 
. . » be resolved, if we can agree that 
there are really a number of different 
kinds of learning. For then it may 
turn out that the theory and laws ap- 
propriate to one kind may well be dif- 
ferent from those appropriate to other 
kinds. Each of the learning theories 
now current may, in short, still have 
validity for some one or more varieties 
of learning, if not for all” (16, p. 144). 
He then goes on to recognize the va- 
lidity of reinforcement theory with re- 
spect to the acquisition of “positive 
cathexes” (although he hastens to add 
that there is no good evidence for the 
view) and, in the matter of “motor pat- 
terns,” allows Guthrie to win by de- 
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fault. In our opinion this genial eclec- 
ticism, although in many respects ad- 
mirable, misses the important point. 
While it is true that each of the major 
theories has tended to focus on limited 
aspects of learning, we can by no means 
be sure that any given theory deals ade- 
quately even with those phenomena with 
which it has been most directly con- 
cerned. The data obtained in experi- 
ments conducted by adherents of the 
various schools must be given respect- 
ful consideration, but it does sot fol- 
low that the explanations advanced to 
account for these data can be accorded 
equivalent status and set side-by-side to 
make a comprehensive theory. Lump- 
ing together the reports of the seven 
blind men does not produce an ade- 
quate description of the elephant. 

In a recent paper we have defended 
the position that at least two qualita- 
tively distinct learning processes must 
be postulated in order to deal ade- 
quately with available data on the 
learning of lower mammals (1). On 
the one hand there is the organization 
and reorganization of motor patterns 
which occurs within the confines of a 
given environment. The animal selects, 
refines, and fixates those responses which 
are appropriate to the relationship which 
exists between internal and external con- 
ditions. This is the problem with which 
reinforcement theorists have been al- 
most exclusively concerned, and the evi- 
dence to some extent justifies their theo- 
retical emphasis on afferent-efferent in- 
tegrations and the role of biological 
satisfactions. But we are stil! very 
much in the dark with respect to the 
nature of the process or processes which 
make selective learning possible. Hull’s 
fourth postulate (6) does not begin to 
deal with this problem, but in fact 
avoids it. Not many years ago learn- 
ing theorists were apt to pose a funda- 
mental question relating to the em- 
pirical law of effect. In what way, 
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they asked, do the satisfying conse- 
quences of a response work back upon 
the organism to increase the likelihood 
that the response will appear on subse- 
quent contacts with the situation? Un- 
fortunately, in this heyday of modern 
positivism, such questions have become 
exceedingly unpopular. 

The data of conditioning experiments, 
on the other hand, suggest the operation 
of a second process which seems much 
easier to understand. Following Maier 
and Schneirla (8) we have chosen to 
refer to it as a process of sensory inte- 
gration. When two afferent centers are 
contiguously activated, a functional re- 
lation is established between them such 
that the subsequent innervation of one 
will arouse the other. Here we postu- 
late a purely afferent process of modifi- 
cation which may operate not only in 
the absence of concurrent motor ac- 
tivity but in the absence of need-reduc- 
tion as well. This process is revealed 
most directly in the development of 
stimulus-equivalence which occurs in 
conditioning—when two stimuli are pre- 
sented contiguously, the first acquires 
some of the functional properties of the 
second. It also helps us to understand 
many of the phenomena which occur 
in other learning situations, especially 
those phenomena which, as the cogni- 
tive theorists have noted, cannot be 
dealt with adequately by Hullian 
theory. It should be recognized, of 
course, that the concept of sensory inte- 
gration has long been implicit in the 
cognitive formulations. Some years ago 
Professor Spence (12) made the point 
clearly when he labeled Professor Tol- 
man’s position an “S-S Contiguity” 
theory. 

In criticism of stimulus-response-re- 
inforcement theory, the cognitive stud- 
ies reiterate one essential point, namely, 
that animals may respond appropriately 
under changed motivational or environ- 
mental conditions ever: iicugh the re- 


H. G. Brrce anp M. E. BitTeRMAN 


sponses elicited are not identical with 
those previously practiced. This means 
that in the course of learning integra- 
tions are developed which cannot easily 
be understood as the attachment of re- 
sponses to stimuli. However, cognitive 
theorists have not been concerned with 
the mechanisms underlying such behav- 
ior, but have contented themselves with 
phenomenal analogies to a presump- 
tively understood human consciousness. 
The important studies of Blodgett 
(2) and Tolman and Honzik (18), 
which clearly demonstrate that the rat 
learns a great deal about a maze under 
conditions which precluce the reinforce- 
ment of specific response patterns, have 
provided the strongest support for the 
cognitive position. That the data of 
these experiments pose an extremely 
difficult problem for stimulus-response- 
reinforcement theory is attested by 
Hull’s failure even to mention them in 
his Principles of Behavior, and by the 
scant attention which they receive from 
S-R theorists in papers on the more re- 
cent, though directly related, researches 
with the T-maze. In our opinion the 
principle of sensory integration makes 
it possible to understand these results. 
It might be profitable to ask once 
more the famil? r question of how a rat 
learns to take the true-path turn, rather 
than enter the cul, at a choice-point in 
a maze. The traditional answer of re- 
sponse-reinforcement theory is about as 
follows: The complex cues (interocep- 
tive as well as exteroceptive) which are 
present at the choice-point become in- 
dividually, and as a group, connected to 
the response of turning in the correct 
direction on the basis of ensuing rein- 
forcement, primary or secondary. But 
the concept of sensory integration sug- 
gests an alternative interpretation. From 
this point of view, not one, but two 
stimulus configurations may function at 
the choice-point, at least in the early 
stages of learning—one arising from the 
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mouth of the cul and the other from 
the entrance to the true path. When 
the animal takes either path a succes- 
sion of new afferent patterns arise, in 
the course of movement, with which the 
entrance-pattern (directly or mediately) 
becomes functionally equivalent. For 
example, when the animal enters a cul 
the entrance-percepi is followed sooner 
or later by a characteristic obstruction. 
If the blind is both short and homoge- 
neous, the closely contiguous relation be- 
tween entrance- and end-patterns gives 
rise to a process of afferent integration 
and the equivalence is readily estab- 
lished. When motivational conditions 
are such that avoidance-behavior would 
be evoked by the cul-end, this behavior 
will now be evoked by the cul-entrance. 
In longer and less homogeneous blinds, 
such as those which involve a number 
of turns, there is a succession of critical 
percepts (principally at the elbows of 
the cul), and end- and entrance-per- 
cepts are less immediately related. Un- 
der these conditions cul-elimination pro- 
ceeds in stepwise fashion, backwardly 
from end to entrance, with the units of 
blind-alley pathway usually being elimi- 
nated as wholes. From our point oi 
view, the less rapid elimination of long 
blinds is explained, not in terms of a 
temporal gradient of reinforcement, but 
in terms of the temporal relation be- 
tween stimulus patterns arising at cul- 
entrance and cul-end. The great diffi- 
culty which is presented by circular 
blinds is to be understood from the fact 
that they provide no characteristic bar- 
rier-percept. The principle of sensory 
integration also leads to the assumption 
that choice-point learning in the early 
sections of a complex maze is to be un- 
derstood, at least in the initial stages 
of learning, not in terms of the de- 
velopment of true-path preferences, but 
rather as a process of cul-avoidance, 
since the afferent consequences of cor- 
rect turns are less structured and less 
directly significant than those of cul- 
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entry. The true path units do, of 
course, eventually acquire more positive 
properties, but in a more gradual fash- 
ion due to their remote relation to the 
goal-object, and the very rapid learn- 
ing which occurs at later points in the 
maze can be interpreted as a joint func- 
tion of integrations which develop with 
respect both to cul- and true-path en- 
trances. It might also be suggested that 
“latent” learning in a complex maze is 
largely confined to the region of the 
culs, as an experiment by Reynolds 
(10) has indicated, and to the goal-box 
which has a rather special afferent char- 
acter derived from the removal of the 
animal from the maze. 

In order to deal with serial learning 
in terms of reinforcement theory, Hull 
is forced to rely heavily on the prin- 
ciple of secondary reinforcement which 
bridges the temporal gap between early 
responses in the series and the terminal 
“primary” need-reduction. According to 
this principle, which is regarded as a 
corollary of the fourth postulate, stimuli 
which arise in the course of running not 
only become guides to action, in the 
sense that they are connected to ap- 
propriate responses, but they—the same 
stimuli—also acquire goal properties 
which are mediated in a backward di- 
rection. In some respects this hypo- 
thetical process resembles our own con- 
ception of the manner in which true- 
path units acquire positive properties, 
but there are important differences. 
Hull maintains that the development of 
secondary reinforcement is itself a proc- 
ess of stimulus-response conection, in 
which the stimuli in question become 
attached to “fractions” of the response 
patterns involved in need reduction. 
Afferent relations are not, in this view, 
established directly, but must always 
be response-mediated, and these rela- 
tionships can only be established when 
one of the stimuli itself arouses need- 


reducing responses. 
Now it is our position that all of the 
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data with which Hull deals in terms of 
the concept of secondary reinforcement 
can be dealt with equally well if one as- 
sumes a process of sensory integration 
in which relations between afferent proc- 
esses, or sensory equivalences, are estab- 
lished, not indirectly, but directly, in 
the course of contiguous arousal, and 
which may occur in the absence of 
drive-decrement. Furthermore, this as- 
sumption makes it possible to account 
for the classical latent learning data, 
while Hull’s conception does not. From 
our point of view the latent learning ex- 
periment may be understood as a com- 
plication of the sensory pre-conditioning 
experiment. It must be emphasized 
that not only a contiguity principle, but 
an afferent contiguity principle, is re- 
quired by this evidence. Neither the 
version of Professor Guthrie (5) nor 
the version of Professor Mowrer (9) 
will do. 

In recent years cognitive theory has 
stimulated a variety of experiments with 
the T-maze, which are closely related to 
the latent learning problem, and which 
point even more clearly to a sensory 
integration process. For the most part, 
these experiments may be grouped un- 
der three major headings. In all three 
the rat is trained in the maze, food 
being situated in one arm and water in 
the other, and later tested under moti- 
vation which corresponds to one or the 
other of these incentives. In the first 
type of experiment the animal is sati- 
ated for both food and water during 
training and in the second type of ex- 
periment.the animal is both hungry and 
thirsty during training. The results of 
both types of experiment show a signifi- 
cant tendency for the rats to respond ap- 
propriately when singly motivated—hun- 
gry or thirsty. Even if reinforcement 
operates in both experiments—second- 
ary reinforcement may be presumed to 
operate in the first—the forced-turn 
procedure employed in training should 
keep the alternative response-tendencies 
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approximately equal in strength, and 
the animal should respond randomly. 
Even if turning tendencies of unequal 
strength become established—due per- 
haps to the greater strength of one or 
the other of the drives or to the greater 
potency of one or the other of the re- 
inforcements—the animal cannot be ex- 
pected to respond appropriately under 
both conditions of single motivation in 
the test trials. The ad hoc principle of 
the selective association of drive-stimuli 
(7) may account for the results of the 
second type of experiment, but it will 
not account for those of the first type 
in which there are no drive stimuli to 
be selectively associated. The last re- 
sort for the reinforcement theorists is 
the untestable concept of unobservable 
anticipatory response (13, 17). 

In a third major type of experiment 
stimulated by cognitive theory, the ani- 
mal is trained under one drive and 
tested under the second. Here the ani- 
mal fails to respond appropriately, and 
the reinforcement theorists, as might be 
expected, have made much of this fact. 
A recent experiment by Walker, Knot- 
ter, and DeValois (19) suggests, how- 
ever, that the forced-turn procedure 
ordinarily employed may in great meas- 
ure be responsible for the negative re- 
sults. Trained under conditions of wa- 
ter “eprivation, with water in one goal 
box and both food and water in the 
other, their animals did respond appro- 
priately when the drive was shifted to 
hunger. In another variation of the 
basic design Strange (14) found clear 
evidence of latent learning under single 
drive conditions. 

In conclusion, there are two points 
that we should like to make. First, 
there is a good deal of other evidence 
which, in our opinion, can be better un- 
derstood in terms of the concept of sen- 
sory integration than in terms of either 
cognitive or reinforcement theory. Our 
purpose here was only to show how the 
idea of sensory integration which was 
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developed originally to deal with the re- 
sults of conditioning studies could be 
applied to behavior in other standard 
learning situations. Secondly, we should 
like to note that the sensory integration 
postulate is regarded, not as a final 
solution to our problems, but as a basis 
for further research. What conditions 
other than contiguity of. stimulation af- 
fect the integration process, and in what 
manner? Do the afferent consequences 
of motivational states play a special 
role? It seems that they do, although 
it is likely that the onset and presence 
of these afferents are just as significant 
as their decline and disappearance (11). 
Above all, there is the question of how 
an understanding of sensory integration 
will make it possible for us to deal 
with the problem of selective learning. 
Traditional stimulus-response theory is 
based on the assumption that any 
stimulus may be tied to any response. 
But suppose we say instead that, not 
stimuli, but afferent organizations or 
afferent integrations produce character- 
istic patterns of response, and that for 
each pattern of afferent organization 
there can be one, and only one, pattern 
of response. It follows directly frora 
this formulation that the process of se- 
lective learning can be considered, not 
as a process of stimulus-response con- 
nection, but as @ process of achieving 
the afferent organization required to 
produce a given pattern of response. 
Here at last we may have a meaningful, 
process-oriented translation of the dic- 
tum of the cognitive theorists: Given 
knowledge, behavior will take care of 
itself. 
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THE CONTIGUITY PRINCIPLE IN LEARNING THEORY 


BY FRED D. SHEFFIELD 
Yale University 


The last twenty years of basic learn- 
ing theory has reflected a period of the 
supremacy of the law of effect and the 
discounting of contiguity. This trend 
started when conditioned response peo- 
ple began noticing that conditioning was 
often complicated by selective learning 
which did not seem to follow the Pav- 
lovian rubric. We can perhaps identify 
the starting point of this era with the 
1928 article by Miller and Konorski 
(12) on a special type of conditional 
reflex—later to be named the condi- 
tioned or discriminated “operant” (19), 
the “instrumental” conditioned response 
(7), and so forth. The development 
subsequent to Miller and Konorski has 
been one of rediscovering the law of 
effect—and relegating contiguity to a 
minor role either as a primitive learning 
principle among dualists or as a neces- 
sary but insufficient factor among the 
single-process effect theorists. 

Now, I suspect, we are on the thresh- 
hold of a rediscovery of contiguity. I 
think Mowrer’s paper announcing his 
switch to dualism in the 1947 Harvard 
Educational Review (13) is one of the 
signs of the times. Also the 1949 Birch 
and Bitterman (1) critique of the ef- 
fect position—and support of Maier 
and Schneirla (10) on dualism—I in- 
terpret as an undercurrent of dissatis- 
faction with the recent pre-eminence of 
effect as the major learning principle. 
Mowrer has elevated contiguity to an 
equal status with effect as a sometimes- 
sufficient condition for learning. Birch 
and Bitterman have hinted that their 
sensory integration version of contiguity 
will handle phenomena which even 
previous dualists have corsidered the 
exclus:ve property of effect. In each 


case these authors suggest a trend to- 
ward reinstating the significance of 
contiguity as a sufficient condition for 
some learning. 

At the same time there seems to be 
another sort of trend underway in fa- 
vor of contiguity. This second trend I 
would say is more relevant to contiguity 
as a factor than contiguity as sufficient. 
It consists of the discovery that conti- 
guity is a powerful source from which 
to derive learning phenomena without 
regard to one’s position as to whether 
contiguity is sufficient—or only neces- 
sary. A good illustration is seen in 
Estes’ (3) recent article in which he 
disowns the reinforcement problem and 
proceeds to deduce a mathematical 
model of selective acquisition based on 
a very Guthrian (6) set of assumptions 
flowing from contiguity alone. Other 
illustrations are seen in modern inter- 
pretations of partial reinforcement, as 
presented in the Jenkins and Stanley 
(9) review article, and in the interpreta- 
tions of extinction and spontaneous re- 
covery in Skinner’s (20) very recent 
paper challenging theories in general 
and S-R reinforcement theories in par- 
ticular. 

Perhaps my perception of the current 
situation is biased, but I feel that we 
are getting further and further in han- 
dling the major phenomena of learning 
with less and less worry about reward 
as a factor. Perhaps we can carry it all 
the way and eliminate the law of effect 
from a general treatment of learning. 
I do not feel entirely optimistic about 
this parsimonious outcome but I think 
it is worth a concerted effort. And at 
the moment I would place the dualist 
position as a halfway point in such an 
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effort. It strikes me as an easy out— 
with defeatist consequences—for a cur- 
rent situation in which neither conti- 
guity nor effect can make a convincing 
universal case. 

On a less intuitive basis my 0” ‘ec- 
tions to dualism are easily classified: 
(1) there appears to be strong evi- 
dence that learning—when properly de- 
fined—is a single process, and (2) the 
arguments of the dualists are uncon- 
vincing whether one takes a single- 
principle cuniiguity approach or whether 
one plays devil’s advocate and takes a 
single-principle effect approach. 

Starting with the first of these—the 
evidence for a single process—my point 
is simply that the phenomena exhibited 
in selective learning are so similar to 
those of Pavlovian conditioning that it 
is hard to believe in any differences in 
physiological process. One should of 
course be prepared to expéct that two 
different physiological mechanisms will 
have the same overall outcome—as when 
temperature regulation is achieved both 
by perspiration and by vasodilation. 
But one should not be prepared to ex- 
pect that two different mechanisms will 
exhibit point-by-point similarity in their 
detailed properties. Yet this is the 
case in a conditioned modification and 
a selectively acquired one. The ease 
with which the S-R reinforcement peo- 
ple have been able to appropriate Pav- 
lov’s findings attests to the similarity 
of the details of the two forms of learn- 
ing. In such matters as acquisition, ex- 
tinction, spontaneous recovery, generali- 
zation, differentiation, effects of partial 
reinforcement, and so forth, it makes 
no difference whether we are talking 
about a Thorndikian or a Pavlovian 
learning situation. The impressive fact 
is that we can translate from one disci- 
pline to the other by the simple ex- 
pedient of interchanging appropriately 
the terms “reward” and “unconditioned 
stimulus.” To me this makes it very 
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improbable on @ priori grounds that 
there is not a common process. The 
notable differences are not in properties 
but rather in inferred experimental op- 
erations in reinforcing with an uncondi- 
tioned stimulus on the one hand and 
with a reward on the other. This sug- 
gests looking extensively for a hidden 
similarity in operations or process—it 
does not readily suggest that there is a 
different process to go with each ex- 
perimental operation. 

In my second objection to dualists— 
that their arguments are not convinc- 
ing—I shall not attempt to cover all 
dualists. Instead I will mention only 
present company. I am not impressed 
by the Birch and Bitterman argument 
because it hinges primarily on what I 
consider to be an incorrect analysis of 
avoidance training. Here I have to 
play devil’s advocate and defend. the 
law of effect. Their main objection to 
a single-principle effect theory is that 
they believe the findings of avoidance 
learning disprove the Hullian hypothe- 
sis that cessation of pain is a reward. 
Their disproof is too easy to be true. 
The argument runs as follows: in avoid- 
ance training pain and its termination is 
invariably correlated with failure to re- 
spond; mo pain is invariably correlated 
with the correct response. Therefore, 
they argue, according to Hull failure to 
respond is rewarded by pain termina- 
tion whereas performance of the cor- 
rect response is subjected to extinction 
through non-reward. Since the facts of 
avoidance learning are just the opposite, 


- Hull’s notion of the reward value of 


pain termination is said to be disproven. 

However, Birch and Bitterman have 
ignored entirely the factor of condi- 
tioned fear as a drive—and reduction 
of fear as a reward. These factors 
play a critical role in modern effect in- 
terpretations of avoidance learning as 
formulated by Mowrer (13) and others 
(14, 16). In this interpretation the 
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critical fact is that failure to make the 
correct response becomes very frighten- 
ing because it invariably produces pain. 
At the same time performance of the 
correct response cannot become fright- 
ening because it is invariably not cor- 
related with pain. Thus in avoidance 
training we create an instrumental learn- 
ing situation—with fear as drive—and 
in which the only way to convert a 
frightening situation into a nonfrighten- 
ing one is by lifting the paw, leaping 
into the air, or whatever else the experi- 
menter might require. This can be 
handled by Hull (8) in drive-reduction 
terms or by Guthrie (4) in stimulus- 
change terms. 

I am more impressed with the Birch 
and Bitterman argument that sensory 
pre-conditioning is an exception to the 
law of effect, although I am also fa- 
miliar enough with effect dialectics to 
know that there are several ways out. 
For one we can postulate an investi- 
gatory drive which is satisfied by at- 
tending to any and all environmental 
changes. For another we can note that 
with strong stimuli the mere cessation 
of the stimulation is reinforcing to what- 
ever response the animal makes to the 
stimuli. Thus we may predict that the 
response to S, becomes connected to S. 
and vice versa—and will mediate just 
about the amount of transfer actually 
obtained—namely, a small amount in 
most cases. Moreover, the usual small- 
ness of transfer in sensory precondi- 
tioning appears to be inconsistent with 
their sensory-integration interpretation 
of Pavlovian conditioning. Thus the 
Birch and Bitterman argument can be 
said to backfire considerably if the 
question is asked as to why the phe- 
nomenon is always such a weak counter- 
part of direct conditioning. If all con- 
ditioning by contiguity is afferent modi- 
fication, why did Brogden (2), in the 
original experiment on this topic, find 
that some subjects failed to show any 
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transfer, the average transfer was only 
21 per cent of direct conditioning, and 
what little transfer was obtained initi- 
ally extinguished very rapidly? 

The Mowrer and Suter (15) experi- 
ment will also be attacked by single- 
process effect theorists. For example, 
does a signal that precedes shock, ac- 
companies shock, and terminates with 
shock become a cue for fear or a mixed 
cue for fear and the relaxation of fear 
that accompanies the termination of 
shock? Or, since the animal is free to 
move about on a grill, what evidence is 
there that all animals do not momen- 
tarily stop shock immediately by par- 
tial hops from the grill? Or, to be 
technical, what control is offered for 
an Adrian-type “on effect”—with an 
immediate and sharp pain reduction 
through sensory adaptation—as sug- 
gested by Miller and Dollard (11) 
when a cue is correlated with onset of 
pain? 

Personally I think the Mowrer and 
Suter experiment is embarrassing for 
the effect position but not a crucial ex- 
ception. On the other hand, it goes al- 
most without saying that the experi- 
ment would not convince a contiguity 
theorist that one is forced into dual- 
ism; rather it is merely further support 
that contiguity is sufficient. Similarly 
Mowrer’s (13) “collateral support” for 
dualism, while it helps make the dual- 
istic position reasonable, does not at all 
show one is forced into dualism. The 
hypothesis that contiguity applies only 
to visceral-autonomic-involuntary effec- 
tors whereas effect applies only to 
skeletal-central-voluntary effectors is in- 
triguing but unconvincing. Skeletal re- 
sponses can be conditioned and pri- 
marily visceral responses can be selec- 
tively learned. At least this is the 
way I interpret eyelid conditioning and 
the use of rewards and punishments in 
teaching control of bowel and bladder 
sphincters. 
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Moreover, the voluntary-involuntary 
breakdown does not seem to be par- 
ticularly neuro-anatomical. It is well 
known that many visceral responses can 
become voluntary, and skeletal responses 
which are not ordinarily useful never 
come under voluntary control. Few 
people can wiggle their ears, cross their 
eyes at will, or move each toe inde- 
pendently, although all of these can be- 
come voluntary with practice. A rea- 
sonable alternative to the Mowrer hy- 
pothesis is that while skeletal responses 
are continually determining whether re- 
wards are forthcoming, visceral systems 
rarely get exposed to selective learning. 
Instead, visceral systems are reared in 
the constant and predictable type of en- 
vironment which promotes the evolu- 
tion of innate reflexes and prevents se- 
lective learning from getting a chance 
to occur. However, selective learning 


apparently can take over in the au- 
tonomic when, for example, a conflict 
makes acquisition of a psychosomatic 


symptom rewarding. 

These foregoing single-process argu- 
ments can of course be as well utilized 
by an effect theorist, and do not par- 
ticularly support contiguity except in a 
roundabout way. I would now like to 
drop the role of devil’s advocate and 
argue that there is so much room for 
theoretical expansion within the frame- 
work of any theory of selective learning 
that a contiguity approach is at least 
as promising as the effect one. 

The room for expansion comes chiefly 
from the fact that nobody knows what 
“effect” is. It is an odd law that con- 
tains a term so indeterminate in ad- 
vance. It is analogous to the law of 
sleep which states that “sleet: is pro- 
duced by soporific agents.” As cur- 
rently used, effect is defined as any- 
thing which strengthens instrumental 
responses. We can collect an empirical 
list of such events and get a workable 
statement of the law of effect as fol- 
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lows: “If a response occurs contiguously 
with a neutral stimulus pattern, it will 
become connected to that pattern if it is 
followed by one of the things in the 
list.” However, this workable law gives 
only a circular description of the opera- 
tions producing “effect.” We are es- 
sentially defining effect as any event 
which makes the law of effect true. 
Descriptively this procedure may be 
practical but it leaves us up in the air 
about the outcome with an untried 
event. It also makes the law of effect 
exceptionless and untestable from a re- 
ductionist standpoint. 

The reductive alternative to this hand- 
book type of “law” is to seek a com- 
mon factor in the empirical list of re- 
wards. If we hit upon the right factor, 
we can predict the outcome when a new 
untried event is put in the reward po- 
sition in a trial and error situation. 
The search for such a factor has been 
hampered because of one fairly common 
factor in the items of the empirical list 
collected so far. It is the property of 
being events which we feel are desirable 
from tle standpoint of the organism. 
This leads to one of the most common 
and reprehensible versions of the law of 
effect—a version which merely says that 
responses with “adaptive” outcomes get 
fixated, and does not state the procedure 
for determining the adaptivity of a 
response. Such formulations probably 
constitute the most common way of con- 
ceptualizing the law of effect. 

All such purposive conceptions of the 
law of effect are probably merely state- 
ments of the biological significance of 
learning. Selective learning is bound to 
be dominated by adaptive outcomes if 
the animal is left to his own devices in 
a maladaptive situation. He got the 
learning process through natural selec- 
tion and it would be a surprise if it did 
not turn out to be a bodily process 
which helped him adapt to his environ- 
ment. But survival-need reduction is 
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no more a cause of an individual in- 
stance of learning than is cooling of the 
body a cause of a person’s perspiring 
when hot. 

A more mechanistic, and therefore 
more acceptable, interpretation of ef- 
fect is the drive reduction hypothesis of 
Mowrer (13), Miller and Dollard (11), 
Hull (8), and others in the S-R rein- 
forcement group. There are numerous 
difficulties in Hull’s use of “need reduc- 
tion,” but when he and his followers 
speak of “drive reduction”—or still 
more tangibly, of “stimulus-intensity re- 
duction”—we have a principle which 
can predict that most learning will be 
adaptive but also can account for such 
examples as the maladaptive drug ad- 
dict who acquires a habit which is ad- 
justive but not adaptive. 

But a good mechanist is also not 
going to ignore the Smith and Guthrie 
(21) hypothesis that effect is removal 
from the situation. If one is endowed 
with a variety of adaptive reflexes, what 
could be more adaptive, when next 
thrown into a previously experienced 
maladaptive situation, than to do what- 
ever got you out of it the last time you 
were in it? The response that changes 
a maladaptive situation into an adap- 
tive one is inevitably going to be a wise 
choice from an adaptive standpoint— 
and a last-response factor may well have 
been incorporated into the learning 
process by natural selection. 

The Smith and Guthrie hypothesis— 
which they deduce from contiguity—is 
a hard one to test experimentally and it 
has survived wituout real disproof for 
about thirty years. It has the advan- 
tage over the drive reduction position in 
that it accepts contiguity as sufficient 
and therefore does not have the awk- 
wardness of the drive reduction position 
in coping with the conditioning of such 
responses as the knee-jerk, vaso-dilation, 
salivation based on acid, and so forth. 
Moreover, the drive red:iction position 


Frep D. SHEFFIELD 


is faced with problems even in its home 
territory of accounting for instances of 
selective learning where the “reward” is 
not drive reducing in any obvious way. 

Thus in a collaborative study with 
Dr. Thornton Roby (17) we found that 
hungry rats would learn responses for 
which reward was only a very sweet 
solution of saccharine and water. Hun- 
gry rats gorge themselves on such a 
non-nourishing solution and even after 
weeks of acquiring sophistication in this 
form of sham-feeding they readily learn 
to dash to the correct side of a T-maze 
to get a few minutes at the saccharine 
bottle. 

In some other experiments in collabo- 
ration with Jepson Wulff and Robert 
Backer (18) we found that male rats 
reared from infancy in segregation from 
females readily learned an instrumental 
response that led to a chance to copulate 
with a female in heat even though copu- 
lation was always stopped short of 
ejaculation. That is, the “reward” for 
selective learning was an opportunity 
to initiate copulation but the female 
was always removed considerably be- 
fore ejaculation. By some standards the 
“reward” would be called sexual frustra- 
tion. The animals had never ejaculated 
before the experiment so acquired drive 
reduction was ruled out; and they did 
not ejaculate during the experiment so 
primary drive reduction was ruled out. 
Yet the response instrumental in pro- 
ducing the female in heat showed steady 
acquisition compared with controls re- 
ceiving only a male companion in the 
geal box. 

These findings seem contrary to the 
drive-reduction hypothesis of the na- 
ture of “effect,” although they might 
be thought to follow the beneception 
mechanism of Troland (22). Perhaps 
the proper generalization is that the 
common factor is the execution of an 
innate and prepotent consummatory re- 
sponse, regardless of whether it reduces 
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the drive state. Whatever the correct 
inductive generalization, the results are 
more in line with a Guthrian contiguity 
analysis of selective learning than they 
are in line with a Hullian drive-reduc- 
tion analysis. According to Guthrie the 
common factor in the empirical list of 
rewards is that they radically change 
the environment from just before to 
just after reward. The kinds of changes 
Guthrie (5) considers important indi- 
cate that it should be unimportant 
whether or not the animal’s drive is re- 
duced so long as the consummatory be- 
havior remains prepotent and therefore 
able to take the animal “out of the 
situation.” 

I cite these experimental findings 
more to prove the fluid stage of our 
knowledge about reinforcement than to 
bolster the Guthrian contiguity analysis. 
My opinion is that Guthrie is still a 
long way from exhausting the implica- 
tions of contiguity for instrumental 
learning. In the meantime my hunch 
is that we will eventually be forced to 
conclude that rewards are critical only 
as the means of motivating practice and 
performance and that the contiguity 
factor will turn out to be sufficient in 
describing the effects of practice per se. 
In terms of the main topic of our sym- 
posium my chief argument is that dis- 
proof of the universality of the aw of 
effect does not force us into dualism as 
the only alternative—at least not until 
an all-out modern attempt has been 
made to see if association by contiguity 
is the common process both in selective 
learning and conditioning. 
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REFLECTIONS AND CONFESSIONS OF A 
REINFORCEMENT THEORIST * 


BY HOWARD H. KENDLER 
New York University 


During the last two decades it has 
become customary for some psycholo- 
gist, every so often, to take a pot shot 
at the reinforcement principle. These 
attacks have usually taken the form of 
an analysis of certain data followed by 
the inescapable conclusion that the re- 
inforcement principle is untenable. 

The reinforcement psychologists, 
known for their passionate devotion to 
their principles—especially their prin- 
ciple of reinforcement—have not re- 
sponded to these onslaughts in any- 
thing resembling an affable and gracious 
manner. These assaults upon the au- 
thority of the reinforcement principle 
have instigated vigorous and aggressive 
counter-attacks. About a year follow- 
ing the appearance of the “uncalled for” 
criticism, a series of articles begin to 
appear. Some member of the rein- 
forcement army reports new evidence 
which definitely refutes some non-rein- 
forcement formulation. Another mem- 
ber reanalyzes the data initially offered 
in opposition to reinforcement theory 
and discovers to his surprise that the 
data, rather than being contradictory, 
really confirms reinforcement theory. 
While all of this is taking place another 
article—or perhaps a book—is pub- 

1The polemical nature of this paper stems 
mainly from what the author considered to be 
the “social expectancies” of a paper delivered 
at an APA symposium on a very controversial 
topic. Rewriting the paper for publication 
would have required extensive revisions which 
would have been at odds with the agreement 
of the members of the symposium—to pub- 
lish essentially what they said at the meeting. 
If the above paper had been “rewritten for 
publication” it would have said the same 
thing, except in a much more guarded and 
dispassionate manner. 


lished which modifies some aspect of 
S-R reinforcement theory in order to 
increase its explanatory powers. 

Now this attack by non-reinforcement 
psychologists, followed by the vigorous 
defense of the reinforcement behavior 
theorists, has occurred time and time 
again. This routine, which at times has 
become exceedingly dull, is not going 
to be repeated here. Rather than be- 
come defensive about S-R reinforcement 
theory, permit me to become somewhat 
offensive about other formulations. 

The APA meeting is certainly not an 
appropriate place for a sermon, but if 
I were to give a sermon, I would use as 
my text the following quotation from 
page 36 of Chapter V of the book On 
Understanding Science by President 
Conant of Harvard University: 


“We can put it down as one of the prin- 
ciples learned from the history of science 
that a theory is only overthrown by a bet- 
ter theory, never merely by contradictory 
facts.” 


The question which I wish to raise is 
whether there is at the present time a 
more adequate theory of learning than 
the S-R reinforcement theory of Hull, 
Spence, Miller and their co-workers? 
Or to phrase the question somewhat 
differently: What have other theories to 
offer in competition to a theory that has 
made serious, and many successful, at- 
tenpts at explaining the data in the 
fields of the acquisition and extinction 
of conditioned responses (12), discrimi- 
nation learning (26), maze learning 
(8), human rote learning (13), trans- 
position (27), response fixation (6), so- 
cial learning (21), motor skills (1), dis- 
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tribution of practice (15), learnable 
drives (20), psychotherapy (5), etc.? 

So many of the non-reinforcement 
psychologists have been so obsessed 
with the need to criticize the reinforce- 
ment position that they have expended 
most of their intellectual resources on 
such criticism while ignering the far 
more important task of developing the 
explanatory capacities of their own for- 
mulations. If it has been the intent of 
these critics to weaken the reinforce- 
ment position, the fruits of their labor 
have betrayed their motivation. This 
constant barrage of criticism leveled at 
the reinforcement position has been one 
of its main sources of vitality and pro- 
ductivity. 

But let us now return to those theo- 
ries which are offered in competition to 
the S-R reinforcement formulation and 
let us remember “that a theory is only 
overthrown by a better theory, never 
merely by contradictory facts.” 

The first formulation I wish to evalu- 
ate, and frankly the one most vulnerable 
to critical analysis, is the formulation 
hinted at by Maier and Schneirla in 
their book (18) published in 1935, ex- 
panded in their article (19) published 
in 1942, and modified by Birch and Bit- 
terman (2) in 1950. Is this a “better 
theory” than the S-R reinforcement for- 
mulation? Conant (3) suggests that 
we should evaluate an attempt at theory 
construction both in terms of its suc- 
cess or failure in stimulating experimen- 
tation and in terms of its success or 
failure in explaining the known facts. 
In terms of the former requirement the 
evidence—or rather the lack of evi- 
dence—is rather convincing. I am not 
aware of one single bit of research which 
was stimulated directly by the Maier- 
Schneirla-Birch-Bitterman two factor 
learning theory. By chance alone one 
would expect some research to be insti- 
gated by an idea which had its concep- 
tion fifteen years ago and its birth eight 
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years ago. The sterility of this par- 
ticular two factor theory can certainly 
not be attributed to the lack of energy 
of its authors. Dr. Bitterman has pre- 
sented an experimental paper at sev- 
eral of the meetings I have attended in 
the past few years. All of them were 
designed to test some implication of the 
S-R reinforcement theory. Why, may 
I ask, did he not attempt to test some 
implication of the sensory integration 
principle? I believe this question can be 
answered by an analysis of the Maier- 
Schneirla-Birch-Bitterman formulation. 

Maier and Schneirla in 1942 insisted 
that there were two qualitatively differ- 
ent forms of learning. They wrote: 


“ .. in the first stage of the conditioning 
procedure a neutral stimulus develops ex- 
citatory value for a response it previously 
did not control. The essential condition 
for this change in the animal is contiguity 
between the experience of the neutral 
stimulus and the experience of the uncon- 
ditioned stimulus. Then a second stage 
may develop. Once the neutral stimulus 
has become a conditioned stimulus, the re- 
sponse it elicits may be rewarded or pun- 
ished” (19, p. 131). 


That is, in the classical conditioning 
set-up, sheer contiguity of the condi- 
tioned and unconditioned stimuli is the 
necessary condition for establishing the 
initial associative connection, with re- 
ward playing no role. Following this 
initial stage, the second stage may de- 
velop. Once the response anticipates 
the unconditioned stimulus, selective 
learning takes place. Reward then 
functions and serves to strengthen the 
association and perhaps modify the re- 


sponse. 

Dr. Underwood and I (14) examined 
the experimental data offered by Maier 
and Schneirla in support of their postu- 
lating an initial contiguity stage of 
learning. We concluded, like good re- 
inforcement theorists, that the data of- 
fered by them could be integrated within 
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S-R reinforcement theory. I believe we 
gave valid reasons. We also went one 
step further. We suggested some ex- 
perimental tests of their formulations. 
Perhaps this is not the proper thing to 
do. Birch and Bitterman (2) at least 
did not think so since they dismissed our 
proposal, as being “irrelevant” in spite 
of the fact that they were based upon 
Maier and Schneirla’s explicit statement 
that there were two distinct stages of 
conditioning. Actually the reason these 
proposals were considered irrelevant was 
due to the fact that Birch and Bitter- 
man (2) in replying to Underwood and 
my criticism of Maier and Schneirla’s 
two stage theory of learning surrepti- 
tiously substitute the term “process” 
for “stage” and state that the two proc- 
esses (contiguity and reinforcement) 
“are not situationally defined.” They 
also state that “under certain conditions 
both (contiguity and reinforcement) 
must be assumed to occur.” 

What does this sort of discussion 
mean? It means that by not defining 
these “processes” situationally Birch 
and Bitterman have effectively pro- 
tected their formulation against eny 
possible empirical attack. It means 
that, by their failure to provide these 
experimental specifications, combined 
with their assumption that both proc- 
esses of learning can occur simultane- 
ously, they have created an illusory for- 
mulation that seemingly can incorporate 
all the discordant learning data. Actu- 
ally, their formulation explains nothing. 

This lack of contact between the con- 
cept of sensory integration and inde- 
pendent manipulatable variables is not 
surprising when we consider the type of 
logic underlying its proposal. After re- 
jecting their personal interpretation of 
Hullian theory, Birch and Bitterman 
conclude that “there is no alternative 
but to postulate a process of sensory 
integration to account for stimulus sub- 
stitution.” This type of formulation 
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may be aptly described as “negative 
theorizing,” i.c., the validity of one 
viewpoint is demonstrated by the fal- 
laciousness of another viewpoint. 

What surprises me most is Birch and 
Bitterman’s dogmatic insistence that 
“there is no alternative but to postulate 
a process of sensory integration . . .” 
when they so willingly. admit “. . . we 
know very little about the conditions 
under which sensory integration occurs.” 

It seems to me, and I am now speaking 
as a clinician, that the term “sensory 
integration” must have rich connotative 
meanings to Birch and Bitterman in 
spite of obvious lack of objective ref- 
erences. These connotative meanings as 
I have attempted to demonstrate have 
neither heuristic nor explanatory value. 
I fear Birch and Bitterman have com- 
mitted an error which is common to 
many physiologically oriented psycholo- 
gists. They have failed to appreciate 
the fact that physiological speculation 
is not equivalent to scientific explana- 
tion. Adequate explanatory systems, in- 
volving either phvsiological or behav- 
ioral mechanisms, must have deductive 
components (7). 

Let us now turn our attention to the 
two other points of view represented in 
this symposium. Both possess a great 
deal more substance than shadow. It 
would be most beneficial to psychology 
if both theoretical formulations were 
more fully developed so that we could 
adequately estimate their true poten- 
tialities. To be quite frank, in spite of 
my own stated preferences, I feel that 
a disproportionate amount of research 
and theorizing has been done within the 
reinforcement framework. At our rather 
primitive state of development it would 
be unfortunate if we directed most of 
our energies in one direction. I might 
add, nowever, that we obviously cannot 
place the blame for the present “un- 
equal distribution” upon the shoulders 
of the reinforcement psychologists. 
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My major objection to Mowrer’s two 
factor theory (22) is not based upon 
any inherent aversion to two factors. 
To be quite truthful, I have no valid 
objection to even three basic factors— 
or four—or five—or even as many as 
Tolman would like to have. But what 
I do object to is that all the multiple 
process theorists have failed to come to 
grips with the positive construction as- 
pects of their theorizing. Their formu- 
lations have essentially developed from 
negative notions, i.e., a single factor 
contiguity or effect theory is not ade- 
quate. No amount of criticism of other 
formulations can substitute for the very 
difficult and laborious task of develop- 
ing a logical, meaningful theory. The 
problem of sharply distinguishing be- 
tween two major factors in learning is a 
difficult one. It must be done in such 
a way that it is possible to ascertain 
the influence—or lack of influence—of 
each factor at any one time. Both 
mechanisms must be adequately an- 
chored to experimental variables. If 
both mechanisms are assumed to op- 
erate simultaneously—and this certainly 
can be a legitimate assumption—it be- 
comes necessary to specify in what 
manner each mechanism contributes to 
the behavior pattern which is observed. 
If the above requirements are not met, 
multiple process formulations will be 
nothing more than a group of theoretical 
escape clauses which, in any situation, 
can be conveniently invoked. 

Mowrer (22) believes that condi- 
tioned responses involving skeletal 
muscle activity require reinforcement 
but contiguity is a sufficient condition 
for emotional conditioning involving re- 
sponses controlled by the autonomic 
nervous system. Although this division 
at first glance appears to be clear, 
confusion arises when an experimental 
situation involving both skeletal and au- 
tonomic responses is analyzed. I be- 
lieve the experiment by Mowrer and 


Suter (with which I am familiar only 
through secondary sources) demon- 
strates this basic difficulty. 

It may be recalled that these experi- 
menters trained two-groups of rats in a 
situation in which shock served as the 
unconditioned stimulus. For one group 
the conditioned stimulus ended with the 
onset of the shock while for the second 
group the conditioned stimulus was 
terminated at the end of the shock. 
Mowrer and Suter reasoned that ac- 
cording to reinforcement theory the 
second group should evidence greater 
fear. This would follow, according to 
them, because the conditioned stimulus 
persisted, for the second group, until 
reinforcement occurred. Since no dif- 
ference between the groups was ob- 
tained, Mowrer and Suter concluded 
that contiguity alone could account for 
conditioned fear. As Dr. Sheffield has 
indicated, the experimental operations 
do not adequately reveal what responses 
are occurring and what responses are 
receiving reward. Dr. Miller, in an un- 
published article (20), suggests another 
interpretation: 


“Tt is possible that the crucial thing that 
had to be reinforced by drive-reduction 
was the response of starting to be afraid at 
the onset of the conditioned stimulus and 
that this response is different from the one 
of maintaining an already elicited fear 
while the conditioned stimulus persists. If 
this should be true, the experiment would 
not be a crucial test because the interval 
between starting to be afraid at the onset 
of the conditioned stimulus and the drive- 
reduction produced by the turning off of 
the shock was the same for both groups.” 


Although the results of the Mowrer and 
Suter experiment have been presented as 
being embarrassing to a single factor re- 
inforcement theory, I believe it is more 
of a challenge to Mowrer to sharpen 
the empirical operations dividing his 
proposed two factors. 
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And now for some brief comments 
about the single factor contiguity theory. 
I find it very difficult to disagree with 
most of the comments made by Dr. 
Sheffield. This is not meant as an at- 
tempt at reciprocating for the able de- 
fense of reinforcement theory offered by 
Dr. Sheffield at several points in his 
paper (24). The truth of the matter is 
that if I were to abandon the reinforce- 
ment position I believe I would adopt 
the single factor contiguity position. I, 
too, am very impressed with the simi- 
larities between the so-called different 
types of learning. In addition, my pref- 
erence is for a single factor theory be- 
cause of the complicated logical problem 
which develops when multiple processes 
are assumed. 

Implicit in Sheffield’s request that an 
“all out modern attemot” be made to 
develop a single factor contiguity theory, 
is the realization that the contiguity 
“theory” at present is not a theory, but 
really a hope for one. In spite of the 
ingenuity of Guthrie’s writings (10) its 
empirical productivity has been practi- 
cally nil. One may be tempted to ex- 
plain this on the basis that the scenic 
northwest is not the proper environ- 
ment for serious system building. I be- 
lieve a careful reading of Guthrie will 
indicate that there may be other rea- 
sons, in addition to the geographical 
ones, which have permitted contiguity 
theory to remain sterile for so long. 
_, Guthrie writes: 


“he principle of conditioning is not an 
explanation of any instance of learning. It 
is mereiy a blank form which explanations 
of instances of learning may take. It ex- 
presses only tnc conviction that we may 
profitably look for the signs of a move- 
ment along those stimuli which are now 
present and which were once before pres- 
ent with the movement.” * 


2 The importance of this passage was pointed 
out to the author by Professor Sigmund Koch. 
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It seems to me that Guthrie adopted 
an extreme positivistic viewpoint which 
is basically antithetical to theory con- 
struction. By refusing to consider con- 
tiguity as a general explanatory mecha- 
nism the search for sets of relationships 
between the mechanism of contiguity 
and stimulus and response variables was 
effectively stymied. I hope that by a 
“modern attempt” Sheffield is referring 
to the deductive type of theory initially 
espoused by Tolman (28). I think if 
he does, the psychology of learning will 
be greatly benefited by his undertaking. 

I do not wish to leave you with the 
impression that the general undeveloped 
nature of the formulations discussed is 
their only drawback. I feel that there 
is some empirical evidence which is 
somewhat embarrassing to them. How 
does Dr. Mowrer explain the failure of 
F_igard, Dutton, and Helmick (11) to 
condition the autonomic pupillary re- 
sponse? How would Dr. Sheffield ex- 
plain the results of Loucks (16) and 
Loucks and Gantt (17) who report that 
contiguity alone was not sufficient for 
conditioning to occur? Only when re- 
inforcement, in terms of either food or 
shock reduction, was present were these 
experimenters able to obtain condition- 
ing. Could Drs. Birch and Bitterman 
provide any reasonable explanation of 
the work of Grice (9) and Kimble (15) 
which fits so snugly into S-R theory? 
Or could they even give us an adequate 
explanation of why sensory precondi- 
tiouing occurred in some situations and 
not in others? 

In closing I would like to make some 
confessions. Firstly, I am mot a rein- 
forcement theorist because I am con- 
vinced that reinforcement is necessary 
for learning. I am a reinforcement 
theorist purely for pragmatic reasons. 
During the past twenty years, reinforce- 
ment theory has been the most fruitful 
theory both in terms of its ability to 
instigate experimental research and in 
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terms of its ability to integrate various 
phenomena within its theoretical struc- 
ture. For these reasons alone I feel 
justified in conducting my experimental 
work within the framework of S-R re- 
inforcement theory. 

The above does not imply that I am 
completely satisfied with the status of 
reinforcement theory. To a large ex- 
tent the use of the term theory is a 
compliment that it does not sufficiently 
deserve. In spite of its relative sophisti- 
cation, S-R reinforcement theory is still 
basically a primitive formulation. It 
would be unwise for S-R psychologists 
to try to “sell it” for anything more 
than that. It would be also unwise for 
S-R reinforcement psychologists to de- 
velop a “party line mentality” which 
would in any way discourage experi- 
mental work within other . theoretical 
systems. The history of science is filled 
with examples in which a useful concept 
served as a barrier to the acceptance of 
a better one. 

I also feel that there has been a dis- 
proportionate concern with the concept 
of reinforcement aid specific theories of 
reinforcement. This condition can be, 
however, largely attributed to the critics 
of reinforcement theory. Certainly the 
data in psychology are not going to col- 
iapse to the principle of reinforcement 
alone. Rather than worry whether re- 
inforcement is basically “need reduc- 
tion” or some other type of mechanism 
I believe it would be more strategic to 
accept a general reinforcement theory 
which does not specify the underlying 
mechanism. When our motivational 
theory becomes more adequate (and 
great strides have been made in recent 
years), then some of our present prob- 
lems concerning reinforcement will be 
clarified. 

I do not want my emphasis of Con- 
ant’s statement that “a theory is only 
overthrown by a better theory, never 
merely by contradictory facts” to be 
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interpreted as a tacit acceptance of the 
existence of experimental data which are 
contradictory to effect theory. It is in- 
teresting to note that the major findings 
which have been offered in opposition to 
reinforcement theory consist of simple 
demonstrations of certain phenomena. 
An 2xample of this is the phenomenon 
of sensory pre-conditioning. We know 
very little about this phenomenon. How 
is it modified by temporal relationships? 
How is it modified by the intensity of 
the stimuli? What.responses are oc- 
curring during the pairing of the two 
stimuli? When these questions are 
answered I would not be surprised to 
find sensory pre-conditioning data fitting 
neatly into reinforcement theory. 

The same situation existed in regard 
to the problem of partial reinforcement. 
After this phenomenon was systemati- 
cally investigated, the results appear to 
be more easily integrated within S-R 
than within cognitive theory. Rubin 
(23), in work done at New York Uni- 
versity, demonstrated greater resistance 
to experimental extinction by a group 
of animals reinforced 100 per cent as 
compared to a 50 per cent reinforced 
group. In his study, distributed train- 
ing was used and secondary reinforce- 
ment was controlled. These data, of 
course, are quite consiste~: with the re- 
cent results reported by Virginia Shef- 
field (25) and Denny (4). 

In closing, perhaps it would be best 
to repeat my major point. In the pres- 
ent competition between various learn- 
ing theories, S-R reinforcement theory 
is far ahead in the race. May I extend 
a challenge and invitation to other 
theories to catch up? 
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COMMENTS ON MULTIPLE-PROCESS CONCEPTIONS 
OF LEARNING 


BY NEAL E. MILLER 
Yale University - 


When a leading exponent of one hy- 
pothesis becomes converted to another, 
the fact that he has been moved to con- 
fess the error of his former ways car- 
ries great conviction. The experimental 
evidence presented in Mowrer’s paper 
(15), however, does not carry similar 
conviction. He cites Gantt’s (3) ob- 
servation that a conditioned change in 
the heartbeat regularly precedes any 
conditioned instrumental response. This 
certainly is congruent with the hypothe- 
sis that an autonomic response, such as 
fear, is learned first and serves as the 
drive to motivate the instrumental re- 
sponse. But it does not prove that the 
autonomic response is acquired by a 
different type of learning—conditioned 
by contiguity instead of being rein- 
forced by drive-reduction. The other 
experimental evidence, the study by 
Mowrer and Suter (16, pp. 275-283), 
has already been shown by both Kendler 
(7) and Sheffield (17) to contain seri- 
ous flaws which make it inconclusive. 
Moreover, Mowrer (15) himself admits 
that the results are not crucial. This 
leaves us with only a hypothetical ex- 
periment to test a deduction from the 
drive-reduction hypothesis and the as- 
sertion: “. . . it would be rather amaz- 
ing if the deduction were confirmed.” 

In view of the extreme weakness of 
this experimental evidence, I venture to 
suggest that another factor may have 
played a major role. In order to stick 
to a consistent drive-reduction theory 
of reinforcement, one is forced to as- 
sume that the reinforcement for learn- 
ing to fear a painful event is not the 
onset of the pain, but its reduction. 
This seems to be a direct contradiction 


of all that is reasonable. It is to 
Mowrer’s great credit that he has tried 
to formulate a consistent theory and 
thus faced this problem squarely. I 
agree that this application of the prin- 
ciple of drive-reduction goes against 
common sense; I feel uncomfortable 
every time I am forced to make it (2, 
12, 13). Once in a great while, how- 
ever, when common sense is tested, it 
proves to be wrong: an utterly unrea- 
sonable assumption, like a ray of light 
being curved by gravity, or a light stone 
falling as fast as a heavy one, turns out 
to be correct. That is the reason for 
stubbornly insisting on conclusive ex- 
perimental evidence. 

Let us return to Mowrer’s (15) hypo- 
thetical experiment. According to a 
strict drive-reduction theory of rein- 
forcement, the end of the noxious stimu- 
lus must be a reinforcement. There- 
fore, if the signal occurs immediately 
before a noxious stimulus, more fear 
should be learned if the noxious stimu- 
lus is brief than if it is long. This fol- 
lows because (a) the interval between 
the signal and the reinforcing termina- 
tion of the noxious stimulus will be 
lengthened if that stimulus is prolonged, 
and (b) longer intervals are less favor- 
able to reinforcement. Therefore, other 
things equal, a signal followed by a brief 
noxious stimulus should acquire the ca- 
pacity to elicit stronger fear than one 
followed by a prolonged noxious stimu- 
lus.* 


1 Actually the results might be obscured by 
two complicating factors: (1) the intensity of 
the painful stimulation might be increased 
enough by temporal summation so that the 
greater amount of reduction when it ceases 
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This unreasonable deduction is par- 
ticularly interesting because it happens 
to have been tested recently in an un- 
published experiment by Sullivan (22). 
Although Mowrer had no way of know- 
ing it, the experiment he described is 
real rather than hypothetical. Sullivan 
used a change in a pattern of lights as 
the CS, a 90 decibel, 1,000 cycle tone 
as the US, and recorded the galvanic re- 
sponse of human subjects. For Group 
1 the change in the pattern of lights was 
followed by a brief piercing tone (.75”). 
For Group 2 it was followed by a longer 
piercing tone (4.0”). Both the fre- 
quency (p=.05) and the amplitude 
(p = .06) of conditioned responses to 
the light were greater for the group 
having a brief, loud tone as the US. 
Furthermore, although there was no re- 
liable relationship between the ampli- 
tude of the UR and the frequency of 
the CR, there was a relationship within 
each group between the rate of decre- 
ment of the UR and the frequency of 
the CR (p= .14 for Group 1 and .03 
for Group 2). Thus, amazingly enough, 
the unreasonable deductions from a 
strict stimulus-reduction theory of rein- 
forcement seem to be confirmed. This 
experiment should be repeated, how- 
ever, with a control for the possibility 
that the responses of Group 2 were de- 
pressed by a greater amount of negative 
adaptation which may have occurred 
during the presentations of the longer 
tone. 

In another of the papers in this sym- 
posium, Birch and Bitterman (1) have 
argued that the facts of evolution, the 


more than compensates for the delay in the 
reduction, or (2) the effect of delaying the 
offset of the stimulus might be greatly diluted 
if a major part of the reinforcement should 
come from an immediate (perhaps transitory) 
reduction produced by adaptation (13, p. 35) 
immediately after the on-effect. Therefore, it 
would be desirable to try to secure electrical 
recordings, preferably centrally, of the effec- 
tive strength of stimulation. 
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diversity of animal species, mean that 
there must be different mechanisms of 
learning. This does not necessarily fol- 
low. Birch and Bitterman could just as 
well argue that the obvious differences 
between the bodies of Rita Hayworth 
and an artichoke mean that it is ridicu- 
lous to assume that the same funda- 
mental mechanism of cell division by 
mitosis is involved. 

I agree completely with Birch and 
Bitterman that great differences in the 
intellectual capacity of individuals and 
species pose an important problem which 
has been largely neglected. But these 
differences in intellectual capacity do 
not necessarily imply differences in the 
basic mechanism of learning. Insight 
and reasoning may turn out to be mat- 
ters of performance, recombining and 
using habits rather than fixating them. 
In our present state of ignorance about 
the sources of intellectual differences, 
we cannot be certain whether or not 
they involve different types of learning. 

To take an extremely simple example, 
the mechanism of a calculator with ten 
banks of keys and twenty of recording 
dials may be made out of exactly the 
same kinds of parts functioning accord- 
ing to exactly the same principles as 
that of one with only two banks of keys 
and dials. But one can use the 20-dial 
machine to get simultaneously all of the 
constants for computing a correlation 
(=X, SY, =X?, SY’, S2XY), while one 
could not do this with the 2-dial one. 
Mere replication of identical functional 
units allows one to perform new opera- 
tions; it increases the capacity of the 
machine. 

Intellectual differences could be pro- 
duced in still other ways without in- 
volving any new types of learning. The 
sense organs or central analyzers might 
be constructed so that a larger number 
of complex patterns or relationships 
would have distinctive cue value. The 
transient reverberating traces of cues 
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might last much longer. A greater num- 
ber of distinctive cue-producing re- 
sponses might be available. These 
would be differences in performance 
rather than learning. 

Differences in the capacity for per- 
formance may influence the course of 
learning without necessarily involving 
different mechanisms of learning. Thus 
a seal is more likely to be rewarded for 
catching fish and a swallow for catch- 
ing flies. But it seems likely that the 
greater capacity of the seal to learn 
to solve underwater problems and the 
swallow to solve aerial ones depends on 
the difference between flippers and 
wings rather than on basic differences 
in the mechanism of learning. 

I emphasize this point because some 
of our controversies may arise from 
looseness in the definition of learning. 
Thus Birch and Bitterman (1) seem to 
use the terms “learning” and “behav- 
ioral modification” synonymously. This 
enables them to bring in the example of 
the paramecium rendered more flexible 
by immersion in a solvent—a phenome- 
non which presumably is no more an 
example of learning than is the tanning 
of the skin on exposure to sunlight. 

To change the subject, the brief 
sketch by Birch and Bitterman (1) of 
a new theory of the elimination of blinds 
is good because it starts to go into some 
of the details. I certainly hope that 
they continue to develop it with con- 
siderably more rigor and detail. Then 
we will have some basis for comparing 
it with Hull’s (5) theory. 

Paradoxically, a prospectus often 
seems more plausible than a more fully 
developed theory. This is because the 
difficulties become apparent only after 
one attempts to go into rigorous detail. 
For example, Birch and Bitterman (1) 
suggest that the facts of secondary rein- 
forcement can be derived from a prin- 
ciple of sensory integration, established 
directly by contiguity and not depend- 
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ent on drive decrement. This sounds 
plausible. But, on second thought, I 
believe it. would force one to predict 
good “latent learning” in experiments, 
such as the one by Kendler and Mencher 
(8), which have conspicuously failed to 
show such learning. One would also 
deduce that the procedure of pairing a 
neutral stimulus with the presentation 
of food would be just as effective in 
establishing secondary reinforcement 
when the animal was satiated as when 
it was hungry. 

The topic of latent learning brings up 
the claim of Birch and Bitterman (1) 
that the reinforcement theorists have 
pointedly ignored this topic because it 
is embarrassing to their theory. But 
stimulus-response psychologists have not 
neglected “latent learning.” Instead of 
motivating the conspiracy of silence 
which Birch and Bittermann allege, re- 
inforcement theory has pointed the way 
toward a theoretical and experimental 
analysis of so-called latent learning. 


Studies like those of Tolman and © 


Honzik (24) in complex mazes have 
been criticized as involving a large num- 
ber of complex motivational factors such 
as reward from return to the familiar - 
home cage, loss of fear, a possible exer- 
cise or exploratory drive, preferences for 
long unobstructed runs, aversion to be- 
ing blocked at the end of blind alleys, 
etc. Thus the results of these experi- 
ments show an obvious improvement 
before the experimental group is re- 
warded and in the control group. 
Because of the difficulties in control- 

ling these factors in complex mazes, ex- 
periments have been performed in very 
simple T-maze situations. The general 
conclusion from these studies is that 
the little “latent learning” which some- 
times occurs is a great deal weaker than 
the learning when drive and reward are 
known to be present. In short, “latent 
learning” is so relatively weak that it is 
much more plausible to assume that it 
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is produced by uncontrolled sources of 
motivation and reward than to assume 
that it represents a fundamental mecha- 
nism of learning (10). Thus the fact 
that a small amount of so-called latent 
learning sometimes occurs is not em- 
barrassing to the reinforcement theo- 
rists; the fact that it is so weak should 
be extremely embarrassing to those who 
claim that all learning involves cognition 
or sensory integration established with- 
out reinforcement. 

While competing theories have done 
little more than point to the fact of 
“latent learning” (ignoring its uncer- 
tainty and weakness), an analysis in 
terms of reinforcement theory has 
pointed to some of the variables deter- 
mining whether or not it will occur. 
Meyers (11) has demonstrated that the 
sight of food has secondary reinforcing 
value even for satiated animals. Spence 
and his colleagues (21) have shown how 
fractional anticipatory goal responses 
based on secondary reinforcement could 
mediate “latent learning.” They have 
made the reasonable assumption that, 
when an animal is given preliminary 
training in a maze containing an ir- 
relevant goal object (i.e., food when he 
is thirsty but not hungry) and a rele- 
vant one, there is conflict between the 
weak anticipatory goal response aroused 
by the irrelevant object and the strong 
one aroused by the relevant object. On 
the basis of this assumption they have 
deduced that the anticipatory goal re- 
sponse to the irrelevant goal object 
(and hence “latent learning”) should 
be interfered with (1) the closer the 
relevant and irrelevant goal objects are 
in time or space, and (2) the stronger 
the drive for the relevant goal object. 
They have secured experimental evi- 
dence tending to support these deduc- 
tions. 

Similarly, as has already been pointed 
out (7, 17), those who make sensory in- 
tegration or other purely cognitive prin- 


ciples the major factors in learning 
should be far more embarrassed by the 
extreme weakness and: uncertainty of 
sensory preconditioning than the rein- 
forcement theorists are by the fact that 
a little of it sometimes occurs. What 
we need is a theoretical and experi- 
mental analysis which will allow us to 
predict when sensory preconditioning 
will occur and when it will not. It will 
be interesting to see ~shether this analy- 
sis is produced by the reinforcement 
theorists or by their critics. 

In dealing with the topic of sensory 
preconditioning Birch and Bitterman 
(1) raise the old centralist-peripheralist 
controversy. Though most S-R psy- 
chologists have been peripheralists, there 
is no logical necessity for this. One can 
assume that there are central responses 
which obey exactly the same laws as 
peripheral ones and produce stimuli 
which function exactly like those of ex- 
ternal origin (2, 12, 13). The admis- 
sion of central processes, such as per- 
ceptual responses and images, does not 
necessarily imply a different mecha- 
nism of learning. Central responses 
might even require less drive and rein- 
forcement (since they are less effortful) 
and still be following the same laws as 
peripheral ones. 

The concept of a central response dif- 
fers from current cognitive theories in 
that such theories leave the laws of 
cognition vague while implying that 
they are different from those that the 
S-R psychologists apply to peripheral 
responses. For example, Tolman (23, 
p. 368) says with characteristic candor: 
“Wants and cognitions as interfused ac- 
tivities in the concrete apprehended be- 
havior-fields do, I believe, function ac- 
cording to neo-Gestalt laws (which laws 
have, of course, for the most part not 
yet been either precisely discovered or 
stated).” 

Kendler (7) has already pointed out 
some of the logical difficulties inherent 
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in multiple-factor theories. One must 
be able to distinguish each of the proc- 
esses from the others. Mowrer (15) 
has made a clear-cut distinction on 
anatomical grounds, with responses of 
the somatic nervous system reinforced 
by drive reduction and those of the au- 
tonomic conditioned by contiguity. But 
when he includes cognition and meaning 
among the processes conditioned by con- 
tiguity (15), one wonders if he will con- 
sistently relegate them to the autonomic 
nervous system. 

As Sheffield (17) has pointed out, the 
problem of making any distinction is 
complicated by the fact that the sup- 
posedly different types of learning seem 
to follow similar laws. To add to the 
difficulty, the definition of sensory inte- 
gration has an important ambiguity. 
Birch and Bitterman say: “. . . when 
two stimuli are presented contiguously, 
the first acquires some of the functional 
properties of the second.” Which prop- 
erties? Such a principle is impossible 
to disprove, and hence is meaningless. 
Negative results can always be dis- 
missed on the grounds that the particu- 
lar properties in question are not ac- 
quired. 

Contrary to some claims, an em- 
pirical definition of reinforcement is not 
completely ad hoc. The acquisition of 
a given habit is used to determine the 
fact that a certain event is reinforcing. 
This much is ad hoc. Then a predic- 
tion is made that the same event can be 
used to strengthen other 5-R connec- 
tions. This prediction could be either 
right or wrong; it obviously is not ad 
hoc. Definitions of this type have been 
elaborated elsewhere (2, 9, 13). 

Sheffield (17) is correct in pointing 
out that such definitions result in a long 
and unwieldy list and in emphasizing 
the desirability of trying to find the 
common elements in the events included 
in the list. He calls attention to Guth- 
rie’s (4) suggestion that reinforcements 
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are events that radically change the 
stimulus situation. I believe that this 
specification is too broad. Not all 
radical changes are reinforcing. For 
example, while the change from shock- 
on-the-grid to no-shock-on-the-grid will 
reinforce preceding responses, I do not 
believe the opposite change, from no 
shock to shock, will reinforce the re- 
sponses that precede it. But turning 
on the shock takes the rat out of the 
situation of lounging at ease on the 
grid. All of the proprioceptive and emo- 
tional cues from sprawling out relaxed 
are removed when the shock is turned 
on and the animal starts jumping 
around. I would think that Guthrie 
would have to predict that turning on 
the shock whenever the animal scratched 
his ear would teach him this response, 
since it would always be the last re- 
sponse made to the situation of relaxed 
sprawling. Guthrie might argue that 
the jumping responses elicited by the 
shock are incompatible with scratching 
the ear. But the responses of ducking 
down to the food dish and eating are 
equally incompatible with pressing a 
bar above the food dish—indeed they 
sometimes interfere as anticipatory er- 
rors. Furthermore, ducking down to 
the food dish occurs as the last response 
before the situation is changed by eat- 
ing while jumping occurs after the 
situation is changed by electric shock. 
Nevertheless the rat learns to press the 
bar if the changes produced by the 
delivery of food are contingent on that 
act. Then why should the animal not 
learn to scratch his ear if the changes 
produced by electric shock are made 
contingent on that act? The fact is 
that the direction and the magnitude of 
the change seem to be irportant. 
Perhaps Guthrie’s (4) theory can deal 
with the problem raised by the fore- 
going example, but the logic has not yet 
been worked out in enough detail to be 
convincing. It is conceivable that this 
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problem might be dealt with by refin- 
ing Guthrie’s theory to state that the 
removal of strong stimuli (involving 
the discharge of large numbers of neural 
units per second) produces a great ef- 
fect while the removal of weak stimuli 
produces little effect. This would re- 
duce the difference between Guthrie’s 
theory and the version of drive-reduc- 
tion that defines a drive as a strong 
stimulus (13). 

It seems to me that in all of the cases 
(turning off strong shocks, bright lights, 
piercing sounds, etc.) that are most 
clear cut, reinforcement involves the re- 
duction in an intense stimulus, or drive. 
The exceptions—saccharin (18), sexual 
excitement (19), etc——seem to me to be 
complex cases in which it is more diffi- 
cult to determine with certainty whether 
or not drive-reduction is involved. 

Even in these difficult cases I believe 
there is a certain practical advantage in 
stubbornly sticking to drive-reduction, 
if only to infuriate someone into design- 
ing a superior theory backed by convinc- 
ing experiments. For example, those 
who find it easy to postulate multiple 
processes or do not believe in theory 
are not likely to be spurred into action 
by the Sheffield and Roby (18) experi- 
ment showing that drinking a saccharin 
solution is reinforcing. But a strict 
drive-reduction theory demands that the 
saccharin must produce its reinforcing 
effect by temporarily reducing hunger 
(or some other drive), even though it 
does not seem to serve any need. Fol- 
lowing this line of reasoning Mr. Kohn 
and I (14) have measured hunger by 
Skinner’s (20) technique of bar-press- 
ing reinforced at fixed temporal in- 
tervals, and have secured preliminary 
evidence that drinking a saccharin solu- 
tion reduces the rate of bar-pressing 
more than drinking the same volume of 
water. In short, the sweet taste seems 
to produce some immediate drive-reduc- 
tion. We are now in the process of 
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checking up on this more carefully by 
comparing the effects of saccharin in- 
gested by mouth with those of saccharin 
injected through a fistula directly into 
the stomach. 

Having emphasized the drive-reduc- 
tion hypothesis, I want to remind you 
that reinforcement theories of learning 
are not necessarily limited to that hy- 
pothesis. One can assume that there 
are other sources of reinforcement or 
be content with an empirical definition 
(2, p. 42). 

I have stressed the value of trying to 
stick to the most parsimonious theory 
that will account for the facts. In con- 
clusion I would like to point out that 
natural selection often operates on the 
“spare tire” principle rather than on 
parsimony. This may resu't in alterna- 
tive mechanisms that are basically dif- 
ferent. If this is true, the most par- 
simonious theory that will account for 
all of the facts may be quite complex. 
There is no objectivun to outlining pro- 
grams for the development of multiple- 
process or other theories of learning. 
But such outlines are no substitute for 
the hard work of constructing a rigorous 
theory. Until rival theories of learning 
are developed to at least the primitive 
level of current reinforcement theory, 
they are not likely to displace it. 
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CAN REINFORCEMENT THEORY ACCOUNT 
‘FOR AVOIDANCE? 


BY BENBOW F. RITCHIE 
The University of California 


How much of the S-R reinforcement 
hypothesis can be preserved in the face 
of evidence from (1!) the avoidance 
learning experiments, (2) the sensory 
pre-conditioning experiments, and (3) 
the latent-learning experiments? This 
is one way of stating the issue discussed 
in this symposium. 

The fact that the S-R reinforcement 
hypothesis has been precisely employed 
in many areas of behavior leads some 
psychologists (Kendler and Miller in 
this case) to consider the present ques- 
tion as unimportant. This attitude is 
expressed in the emphasis laid upon the 
small, although significant, effects found 
in the sensory pre-conditioning and 
latent-learning experiments. Does the 
smallness of these effects justify us in 
the conclusion that the question is 
trivial? 

A young man went to the father of 
his prospective bride with a question 
which he felt he must have answered. 
“Is Josie a good girl?” he asked. The 
girl’s father replied by listing all her 
good qualities: her marks in school 
were good, she was a good worker, she 
was popular, and so on. But the young 
man did not seem satisfied by these re- 
plies and repeated, “Yes. But is Josie 
a good girl?” “Well,” replied Josie’s 
father, “there is one thing. Josie is just 
the tiniest bit pregnant.” 

If any learning, no matter how tiny, 
occurs in the sensory pre-conditioning 
and latent learning experiments, then 
one basic assumption of the S-R rein- 
forcement hypothesis is false. It is not 
true that all learning consists in the 
strengthening of specific stimulus-re- 
sponse connections by reinforcement. 





The fact that Josie has many other good 
qualities does not make the present 
question a trivial one. 

Since most of the talk in this sym- 
posium concerns the implications of 
avoidance learning for the S-R rein- 
forcement hypothesis, I will restrict my 
discussion to this problem. The basic 
problem, as I see it, is this. What hap- 
pens to the reaction-potential of the 
avoidance response when the response 
antedates the shock and no shock oc- 
curs? 

Let us consider the experimental facts 
first, and then turn to the implications 
of each of the hypotheses presented in 
the symposium. There are many ex- 
perimental situations in which the la- 
tency of the response increases after 
the failure to get shock, and as a conse- 
quence performance becomes unstable. 
‘There is, however, at least one experi- 
ment in which this does not occur. In 
Solomon’s (3) experiment, there was, 
with normal dogs, no increase in la- 
tency, during 600 shockless trials. It 
is not true then that the ,E, of the 
avoidance response is always weakened 
by a shock-less trial. 

The hypothesis of Kendler and Un- 
derwood (2) states that reinforcement 
is provided by termination of shock. 
Without such reinforcement reactive- 
inhibition accumulates and reaction po- 
tential decreases. Sheffield’s contiguity 
hypothesis also depends upon reinforce- 
ment by shock. Reinforcement accord- 
ing to this hypothesis consists in the 
occurrence of an unconditioned stimu- 
lus for the learned response. Sheffield 
presents evidence that in Ais experi- 
ment, without such reinforcement, the 
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habit weakens. Thus neither of these 
hypotheses can account for the Solomon 
data without modification. 

The sensory-integration hypothesis 
does not make any specific assertions 
about how un-learning occurs. If, how- 
ever, it is asserted that un-learning re- 
sults from S, occurring without S2, then 
this hypothesis, too, would be unable to 
account for the Solomon data. 

Finally let us consider what happens 
to the reaction-potential, according to 
Mowrer’s and Miller’s hypotheses, on 
each shock-less trial? Two effects oc- 
cur: (1) the ,H, for the avoidance re- 
sponse is strengthened by the reduction 
in the learned fear drive, and (2) the 
sE, for the learned fear drive is weak- 
ened because it is not followed by shock. 
How will these two effects then affect 
the ,E, for the avoidance response on 
the next trial? Both Miller and Mowrer 
follow Hull in regarding ,E, as a multi- 
plicative function of ,H, and D. Thus 
the increase in ,H, for the avoidance re- 


sponse tends to increase the ,E, for that 
response. But what effect does the de- 
crease in the ,E, for the learned fear 
drive have? First it will decrease D 
(i.e., the magnitude, and probability of 
the fear response decrease, and its la- 


tency increases). As a consequence of 
the decrease in D, the',E, for the avoid- 
ance response will be decreased. Thus 
the two effects of a shock-less trial are 
opposed. Can the Solomon data be ac- 
counted for by the further assumption 
that these opposed effects are equal and 
thus cancel each other, maintaining the 
sE, for the avoidance response at a fixed 
level? I do not think that this assump- 
tion will solve the problem, if there is, 
as Hull says, a maximum value of ,H, 
for any habit. Once this maximum is 
reached, continuous decreases in D will 
be reflected in continuous decreases of 
the ,E, for the avoidance response. 
The shape of the acquisition curve dur- 
ing the first few trials in avoidance 
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learning makes it scarcely plausible to 
assume that the maximum for ,H, is not 
reached soon in Solomon’s experiment. 

Since in all four of the hypotheses we 
have considered there is the basic as- 
sumption that the maintenance of the 
avoidance habit depeads either directly 
or indirectly upon the contiguity of the 
conditioned stimulus and the shock, the 
Solomon data contradict all these hy- 
potheses. The questions of (1) single 
or dual processes, (2) stimulus-response 
or sensory integration, or (3) conti- 
guity or need-reduction, are all second- 
ary questions. The fundamental ques- 
tion is whether any simple association 
hypothesis which is designed to account 
for both learning and extinction in other 
kinds of learning situations is also ca- 
pable of accounting for the Solomon 
data. 

In defense of the Miller-Mowrer hy- 
pothesis one might further hypothesize 
that learned fear responses have differ- 
ent extinction curves from those which 
characterize all other kinds of responses. 
Miller, in his discussion of the present 
problem, has suggested that such fear 
responses extinguish very slowly. But 
if one is to support this claim with ex- 
perimental evidence, it is essential that 
the experimental criteria for the occur- 
rence of the fear response be distinct 
from the criteria for the occurrence of 
the instrumental avoidance response. 
To point to the failure of Solomon’s 
dogs to extinguish in 650 trials as evi- 
dence that fear responses extinguish 
more slowly than other kinds of re- 
sponses, is to beg the question com- 
pletely! For it is just as reasonable to 
conclude from this same evidence that 
instrumental avoidance responses, once 
acquired, do not depend for their main- 
tenance upon continued reinforcement. 
This second interpretation is, as we 
shall see, the one proposed by the ex- 
pectancy hypothesis. Finally, only if 
the advocates of the Miller-Mowrer hy- 
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pothesis can provide independent evi- 
dence concerning the extinction of fear 
responses, will their interpretation be 
preferable to the expectancy hypothesis. 

Mowrer’s suggestion that the fear re- 
sponse is a complex of visceral responses 
controlled by the sympathetic nervous 
system is a step in this direction. Could 
it be demonstrated (1) that such vis- 
ceral responses are necessary for the 
acquisition and maintenance of the 
avoidance response, and (2) that such 
visceral responses are excessively diffi- 
cult to extinguish, then the Miéiller- 
Mowrer hypothesis would be a satis- 
factory explanation of avoidance learn- 
ing. But Solomon’s evidence drawn 
from sympathectomized dogs shows that 
the first of these two conditions is not 
satisfied. Concerning the second condi- 
tion there is no evidence at present that 
the extinction rates of responses con- 
trolled by the sympathetic nervous sys- 
tem are at all different from those of 
responses controlled by the somatic 
nervous system. In the light of these 
considerations, then, it seems that avoid- 
ance learning still poses one cf the basic 
problems for reinforcement theories. 

The hypotheses which we have so far 
considered assume that the habit learned 
in avoidance training is weakened on 
each occasion when the signal occurs 
without the shock. The Solomon data 
suggest that this assumption may be 
false. What about the expectancy hy- 
pothesis? Does it also make the same 
assumption? 

One postulate of the expectancy hy- 
pothesis is that when a signal is not 
followed by what it signifies, the ex- 
pectation aroused by the sign is weak- 
ened so that on the next occasion of the 
signal the expectation will be weaker, 
have a longer latency, and lower prob- 
ability of occurrence. Thus the main 
problem is to specify what, according 
to the expectancy hypothesis, is signified 
by the signal. Hilgard and Marquis 
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(1) in presenting the expectancy hy- 
pothesis appear to regard the shock as 
the signification of the signal. Given 
this interpretation, it is clear that the 
expectancy hypothesis must assert that 
when the signal occurs without the 
shock the expectation weakens, and this 
hypothesis is in no better position to 
interpret Solomon’s results than the hy- 
potheses we have already considered. 
But the Hilgard and Marquis exposi- 
tion of this hypothesis omits a charac- 
teristic of the signification which is es- 
sential. The signal signifies something 
with such and such properties in a par- 
ticular spatio-temporal location. With- 
out this double aspect of the significa- 
tion there seems to be’ no reason for 
calling the dispositions aroused by the 
signal, “expectations.” Thus should the 
signal merely arouse a preparatory dis- 
position for, say caviar, in the sense that 
the animal is now eager to eat caviar, 
wherever he finds it, then there is no 
rationale for saying that this disposition 
is true or false. This disposition is 
simply an appetite for caviar. But 
when this appetitional disposition is 
combined with a spatial disposition so 
that when the signal is given the ani- 
mal is prepared for caviar in a specific 
place and is not prepared for it in other 
places, then we may say that this com- 
pound disposition is true if caviar is in 
that place, and false if it is not. Such 
compound dispositions I call “expecta- 
tions.” They are strengthened by find- 
ing the signified object in the signified 
location. They are weakened by not 
finding that object in that location. 
Thus in a simple T-maze problem the 
expectation of food at L, is strength- 
ened by finding food at L, and weak- 
ened by not finding food at L,. It is 
unaffected, however, by either (1) find- 
ing food at Lg, or (2) not finding food 
at Lg, where Lz is any place which is 
discriminable from L,. In the two lat- 
ter cases only the strength of the ex- 
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pectation of food at Lg is changed. 
Consideration of these statements makes 
it clear that the behavior at the choice 
point cannot be taken as unequivocal 
evidence for the assertion that the ani- 
mal expects food at location A, since the 
behavior at the choice point is deter- 
mined by a variety of factors including 
the relative strengths of expectations of 
food at L, and Lg. Instead the behav- 
ioral test for such expectations must be 
vased, as Tolman has pointed out, upon 
what the animal does at L, and Lg. In 
discussing M. H. Elliott’s experiments 
on the effect of changes in the goal, 
Tolman says: “On the trials immedi- 
ately after the change, Elliott observed 
a disrupted, searching sort of behavior. 
. . . The point we would make is that 
such ‘searching,’ i.e., disruption, is to be 
taken as the empirical evidence for, and 
definition of, an imminent expectation 
of the previously obtained bran mash” 
(4, p. 74). In experiments at the Uni- 
versity of California precise behavioral 
tests for such expectations may enable 
us to measure expectation-strength in- 
dependently of the animal’s behavior at 
the choice point. When this has been 
accomplished, we will be in a position 
to discover how and when such expecta- 
tions affect choice point behavior. 

Let us return now to a consideration 
of the application of the expectancy hy- 
pothesis to avoidance learning. What 
happens to the expectation when the 
animal responds to the signal and so 
succeeds in avoiding the shock? Ac- 
cording to the present statement of the 
expectancy hypothesis the signal signi- 
fies shock at a specific place L4. The 
animal is located at L, at the time the 
signal is presented. As a consequence 
of the expectation aroused by the sig- 
nal the animal responds by leaving La 
and moving to a different place L». 
The animal receives no shock at Lp. 
What effect should this consequence 
have upon the animal’s expectation of 
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shock at La? None! The only way of 
weakening the expectation of shock at 
L, is to have the animal remain at Ly, 
and fail to get a shock. As long as the 
animal moves from the region of the 
expected shock, the expectation must 
remain at its original strength. This 
hypothesis, then, will account for the 
fact, if it is a fact, that sometimes the 
response in avoidance learning never 
extinguishes. How can this hypothesis 
account for the fact that such respoases 
often do extinguish? 

It will be remembered from the pre- 
ceding discussion that according to the 
present view the behavior at a choice 
point cannot be taken as unequivocal 
evidence for the assertion that an ani- 
mal has a particular expectation. Many 
factors determine the choice point re- 
sponse and the expectation is only one 
of them. Now the same considerations 
apply to the avoidance learning prob- 
lem. Thus although the animal may 
have the expectation at full strength, 
he may fail to make the avoidance re- 
sponse. Either fatigue or distraction or 
both might on one occasion inhibit the 
response. In such a case if no shock is 
given at L,, the expectation will then 
be weakened. Then on the next oc- 
casion when the signal is given, a 
weaker expectation will be aroused and 
the probability of an avoidance re- 
sponse will be less. From then on, as 
each failure to respond is followed by 
no shock at Ly, the expectation is con- 
tinuously weakened until it disappears 
entirely and as a consequence the re- 
sponse is extinguished. 

Because there has been relatively little 
research upon the extinction of avoid- 
ance learning, there is at present no 
strong evidence for the expectancy hy- 
pothesis. On the contrary, there is 
some evidence in the Solomon experi- 
ment which appears to contradict the 
present hypothesis. I refer to the fact 
that Solomon was unable to produce 
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extinction of the avoidance response 
when he locked the normal dogs in the 
chamber and the signal was not fol- 
lowed by shock. If this procedure pro- 
duced on each occasion a decrement in 
the expectation of the shock at La, as 
it should if the animal was expecting 
shock in that specific place, then ac- 
cording to the present hypothesis, the 
avoidance response should gradually 
extinguish. The basic question, then, 
is whether or not this procedure in 
Solomon’s apparatus does weaken the 
expectation. We have at present no 
independent evidence that it did. Are 
there any reasons for supposing that it 
may not have had this effect? I be- 
lieve there are. 

It should be realized that Solomon’s 
dogs in the original training received 
shocks in compartment A and compart- 
ment B equally often after the same 
signal. Because of this, and lvecause 
both compartments were similar in 
character, it seems reasonable to sup- 
pose that the expectations aroused by 
the signal were ambiguous. Thus fail- 
ure to receive shock in compartment A 
after the signal might not affect the 
strength of such an ambiguous expecta- 
tion. What is needed to settle this 
question is evidence of the effects of 
this training upon the animal’s expecta- 
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tions, evidence which is independent of 

the avoidance response. 

It was emphasized earlier that if 
the Miller-Mowrer hypothesis were to 
become a satisfactory explanation of 
avoidance learning, independent evi- 
dence of the behavior of learned fear 
responses must be obtained. A similar 
problem faces the expectancy hypothe- 
sis. What is needed, here, before one 
can regard it as a satisfactory explana- 
tion, is independent evidence of the 
growth and decline of expectations. Be- 
cause the avoidance learning problem 
forces us in this way to clarify our no- 
tions I regard it as one of the most 
fruitful areas for research in learning 
theory. 
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